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(57) Abstract: A novel method of pyrophosphorolysis activated polymerization (PAP) lias been developed. In PAP, pyrophospho- 
rolysis and polymerization by DNA polymerase are coupled serially for each amplification by using an activatable oligonucleotide 
P* that has a non-extendible 3'-deoxynucIeotide at its 3' terminus. PAP can be applied for exponential ampHfication or for linear 
ampIiiScation. PAP can be applied to amplification of a rare allele in admixuire with one or moxe wild-type alleles by using an acti- 
vatable oligpnucleotide P* that is an exact match at its 3' end for the rare allele but has a mismatch at or near its 3' terminus for the 
wild-type allele. PAP is inhibited by a mismatch in the 3' specific sequence as far as 16 nucleotides away from the 3' terminus. PAP 
can greatly increase tiie specificity of detection of an extremely rare mutant allele in ttie presence of the wild-type allele. Sipecificity 
results from both pyrophosphorolysis and polymerization since significant nonspecific amplification requites die combination of 
mismatch pyrophosphorolysis and misincorporation by the DNA polymciase, an extremely rare event. Using genetically engmeered 
DNA polymerases.^reat3y improves the efficiency of PAP. 
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TITLE OF THE INVENTION 

PYROPHOSPHOROLYSIS ACHVATED POLYMERIZATION (PAP) 
BACKGROUND OF THE INVENTION 

[OOQl] This invention relates to nucleic add polymerization and amplification. In particular, it 
rdates to a novel and geaeiral method for nucleic acid amplification, in which 
pyrophosphorolysis and polymerization are serially-coupled. The method has been ad^ted for 
aUele-specijBc amplification and can greatly increase the specificity to detect an extremely rare 
allele in the presence of void-type alleles. We refer to the method as pyrophosphorolysis 
activated polymerization (PAP). 

[0002] The publications and other materials used hereia to illuminate the background of the 
invention or provide additional details respecting the practice, are incorporated by refeuencei, and 
for convenience are respectively grouped ia the appended Bibliography. 

[0003] Multiple methods for detecting mutations present in less than 10% of cells (i.e. rare 
alleles) have been developed, iacluding PGR amplification of specific alleles (PASA), peptide 
nucleic acid (PNA) clarapiag bloclter PGR, allele-specific competitive blocker PGR, mismatch 
amplification mutation assay (MAMA), restriction fragment-length pol3morphism (RFLP)/PCR 
(Parsons and Heflich, 1997) and QE-PCR (Ronai and Minamoto, 1997). These methods: i) 
amplify the rare allele selectively, ii) destroy the abundant wild-type allele, or iii) spatially 
separate the rare allele firom the wild-type allele. The specificity achievable under typical 
research/clinical conditions is 10"^ (Parsons and Heflich, 1997), although a few pubhcations 
reported higher i^ecificily of detection (Poutzand and Cemtti, 1993; Knoll et al., 1996), These 
meQiods either do not generally achieve the higher specificity or are not suitable for roufine 
analysis. 

[0004] A robust m^od of detecting one mutant allele in 10'*-! 0^ wild-type alleles -would be 
advantageous for many applications including detecting minimal residual disease (recurrence 
after remission or rare remaining cancer cells in Ijmph nodes and other neighboring tissues) and 
measurement of nurtation load (the fiBquesncy and pattem of somatic mutations present in 
normal ti^es). Individuals with a hi^ mutation load may be at increased ri^ for cancer due to 
either environmental exposure or endogenous defects in any of himdreds of genes necessary to 
maintain the integrity of the genome. For Hiose individuals found to have a high mutation load, 
clues to etiology can be obtained by defining the mutation patteani. 

[0(M»5[ There are many DNA sequaidng methods and their variants, such as the Sanger 
sequencing using dideoxy termination and denaturing gel electrophoresis (Sanger et al., 1977), 
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Maxam-Gilbert sequenciag using chemical cleavage and denaturing gel electrophoresis (Masam 
and Gilbert, 1977), pyro-sequencing detecting pyrophosphate (PPO released during the DNA 
polymerdse reaction (Ronaghi et al., 1998), and sequencing by hybridization (SBH) using 
oligonucleotides (Lysov et al., 1988; Bains and Smith, 1988; Dnnanac et al., 1989; Khrapko et 
al., 1989; Pevzner et al., 1989; Southern et al., 1992). 

[0006] There are multiple gel-based methods for scanning for unknown mutations including 
single stranded confomiation polymorphism (SSCP) and the SSCP-hybrid methods of dideoxy 
fingerprinting (ddF), restriction endonuclease fingerprintmg (REF), and Detection Of Virtually 
AU Mutations-SSCP (DOV AM-S), denaturing gradient gel electrophoresis (DGGE)^ denattiring 
HPLC (dHPLC) chemical or enzymatic cleavage (Sadcar et al.^ 1992; Liu and Sommer, 1995; 
Liu et al.i 1999; Myers et al., 1985; Cotton et al., 1988; Liu et al., 1999; Buzin et al., 2000; 
Spiegehnan et al., 2000). DOVAM-S and chemical cleavage reactions have been shown in 
blinded analyses to identify essentially all mutations (Buzin et al., 2000). dHPLC, which is 
based on reverse phase chromatography, also may identify essentially all mutations under 
appropriate conditions (O'Donovan et al., 1998; Oefiier and Underbill,, 1998; Spiegehnan et al., 
2000). Efforts are under way to develop general scanning methods with Mgjiea: througliput. 
[0007J Sequencing by hybridizatipn (SBH) is being adapted to seantdhg or resequeacing for 
unlcaown mutations on mieroanrays (Soufhem, 1996). This contiimies to be a promismg area of 
intense study. However it is not possible as yet to detect most microinsertions and deletions with 
this approach and the signal to noise ratio for single base changes precludes detection of 5-10% 
of single nucleotide changes OHacia, 1999). Alternative approaches warrant exploration. 
[0008] It is becoming increasingly apparent that in vivo chromatin structure is crucial for 
mammalian gene regulation and development. Stable changes in cbxomatin structure often 
involve changes in methylation and/or changes in histone acetylation. Somatically heritable 
changes m chromatin structure are commonly called epigenetic changes (Russo and Riggs, 
1996) and it is now clear that epigenetic "mistakes" or epimutations are frequently an important 
contributing factor to the developnaent of cancer (Jones and Laird, 1999). 
[0009] One of the few methods for. assaying in vivo chromatia structure, and the only method 
with resolution at the single nucleotide level, is hgation-mediated PGR (LM-PCR) (Mueller and 
Wold, 1989; Pfeifer et al., 1989) and its variant of terminal transferase-mediated PGR (TD-PCR) 
(Komura and Riggs, 1998). Many aspects of dnromatin structure caa be determined by LM- 
PGR, such as the location of methylated cytosine residues, bound transcription factors, or 
positioned nucleosomes. It is readily apparent tiiat LM-PCR works better with some primer sets 
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than with others. Thus, it is desired to develop a more robust method of measuring chromatin 
structure. 

[0010] Thus, it is an object of the present invention to develop alternative methods for 
amplification of DNA, for sequenciag DNA and for analysis of chromadn structure. This object 
is accomplished by the use of the novel pyrophosphorolysis activated polymerization (PAP) as 
described herein. PAP has the potential to enhaace dramatically the specificity of the 
an^hfication of specific alleles, for resequencing DNA and for chrofnatui structure analysis, 

SUMMARY OF THE INVENTION 

[0011] The invention is a pyrophosphorolysis activated polyniedzation (PAP) method of 
syiiihesizdng a desired nucleic acid strand on a nucleic acid template straiid. The method 
comprises the following steps carried out serially, 

{0012] (a) AnneaHng to the template strand a complementary activatable oligonucleotide P*. 
This activatable oUgoimcleotide has a non-extendible 3' terminus that is activatable by 
pyrophosphorolysis (hereinafter referred to as a non-extesndible 3' termiaus or a 3' non- 
extendible end or a noii-extetidible 3' end). The non-extendible 3' terminus (or end) is a 
nucleotide or nucleotide analog which has the capacity to fonn a Watson-Crick base bah with a 
complemeirtaiy nucleotide and Which lacks a 3' OH ciapable of being extended by a nucleic acid 
polymerase. In one embodinieirt, the non-extendible 3' terminus may be a non-escteaidible 
3'deoxynucleotide, such as a dideoxynucleotide. In a second embodiment, the flon-estendible 3' 
terminus may be a chesttiically modified nucleotide lacldng the 3' hydroxy] group, such as an 
acycldnucleotide. Acycldnucleotides substitute a 2-hydro3cyethoxymethyl group for the 2'- 
deoxyribofuranosyl sugar normally present in dNMPs. In other embodimeaats, the non- 
extendible 3' terminus may be other blockers as described herein, hi one embodiment, the 
activatable oligonucleotide P* has no nucleotides at or near its 3' terminus that mismatch the 
corresponding nucleotides on the template strand. In a second embodiment, the activatable 
oligonucleotide P* has a mismatch at or within 16 nucleotides of its 3' terminus with respect to a 
corresponding nucleotide on the template strand. The terminal 3'-deoxynucleotide is hybridized 
to the template strand when the oligonucleotide P* is amealed. 

[0013] (b) Pyrophosphorolyzing the annealed activatable oligonucleotide P* with 
pyrophosphate and an enzyme that has pyrophosphorolysis activity. This activates the 
ohgonucleotide P* by removal of the hybridized non-extendible 3' terminus. 
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[0014] (c) Polymerizing by extending the activated oligonucleotide P* on the template stnand in 
presence of four nucleoside triphosphates of their analogs and a nucleic acid polymerase to 
synthesize the desired nucleic acid strand. 

[0015] The PAP method can be applied to amplify a desired nucleic acid strand by the following 

additional steps. 

[0016] (d) Separating the desired nucleic acid strand of step (c) from the template stratid, and 
[0017] (e) Repeating steps (a)-(d) until a desired level of amplification of the desired nucleic 
acid strand is achieved. 

[0018] In a prefeired aspect, the PAP method as described above is ^lied to alleleipecific 
amplificatiaa (PAP-A). la tins application, the nucleic acid template strand is a smse or 
aatisense strand of one allele and is present in admixtiHe with the conesponding (sense or 
anJisense) nucleic acid strmd of the second allde (the allelelic strand). The. activatable 
oligonucleotide P* has at least one nucleotide or analog at or near its 3' terminus, e.g., within 16 
nucleotides of the 3' teimirius, that mismatches the corresponding nucleotide of the allelic strand. 
Because of the mismatch, in step (a) of the PAP method the non-extemfible 3' teimmns of 
oKgonucIeolide P* is not substantially hybridized to the allelelic strand. lii step (b) the 
pjrophosphorolysis does not substantially ranove the non-^hybridized non-extemdible 3' 
terminus fiom the activatable oligonucleotide P* annealed to the allelic strand. In step (c) the 
oligonucleqlide P* is not substantially extended by polymerization on the allelic strand. As a 
result, ttie desired nucleic acid strand synthesized on fee template strand is amplified 
preferentially over any nucleic acid strand synthesized on the allelelic strand. 
10019] In a second preferred aspect, the PAP-A method described above can be performed 
bidirectionally (Bi-PAP-A). Bidirectional-PAP (Bi-PAP) is a novel desi^^ that preferably uses 
two opposing pyrophosphorolysis activatable oligonucleotides (P*) with one nucleotide overlap 
at fheu: 3' tenrnni. Thus, in Bi-PAP, PAP-A is performed with a pair of opposing activatable 
oligonucleotide P*s. Both the downstream and upstream P*s are specific for the nucleotide of 
interest at the 3' tennini (e.g., an A:T base pair). In the initial round of amplification from 
genomic DNA, segments of undefined size are generated. In subsequent rounds, a segment 
equal to the combined lengths of the oligonucleotides minus one is amplified exponentially. 
Nonspecific amplification occurs at lower frequencies because this design eliminates 
misincorporation error from an unbloclced iipsfream. The P*s may be 30-60 nucleotides for 
most efficient amplification. 



wo 03/095664 



PCT/US03/14556 



5 

[0020] The PAP method can be used to amplify either ItNA or DNA. "When used to amplify 
DNA, the activatable oligonucleotide P* may be a 2'-deoxyoIigoniicleotide, the non-extendible 
3' terminus may be, e.g., a 2',3'-dideoxyiiucIeotide or an acyclonucleotide or other blockers as 
described herein, tlie four nucleosidt) triphosphates are 2'-deoxynuclcoside triphosphates or their 
analogs, and the nucleic acid polymerase is a DNA polymerase. The DNA polymerase used in 
step (c) can also be the enzyme having pyrophosphorolysis activify used in step (b). Preferred 
DNA polym^es having pyrophosphorolysis activify are thermostable Tfl, Taq, and geaeficaliy 
engineered DNA polymerases, such as AmpHra^Fs and ThermoSequenase™. These 
geaietically engineered DNA polymerases have the mutation F667Y or an equivalent mutation in 
theii- active sites. The use of genetically engmeered DNA polymerases, such as AmpIijTa^Fs 
■ and ThennoSequsoase™ greatly improves the efficiency of PAP. These Family I DNA 
polymerases can be used when the activatable oligonucleotide P* is a 3 ' dideoxynucleotide or an 
acyclonucleotide. When the activatable oligonucleotide P* is an acyclomicleotide. Family H 
archaeon DNA polymerases can also be used. Examples of such, polymerases include, but are 
not limited to, Vent (exo-) and Pfa (exo-). These polymeiases efScientiy amplify 
S'acyclcirucleotide blocked P*. Two or more polymerases can also be used in one reaction. If 
the template is RNA, the nucleic acid polymerase may be RNA polymeirase, reverse 
transcriptase, or their vatjairts. The activatablei oligonucleotide P* may be a ribonucleotide or a 
2'-deoxymicleotide, The non-extendible 3' terminus may be a 3' deo(xyribonucleotide or an 
acyclonucleotide. The four nucleoside triphosphates may be ribonucleoside triphosphates, 2' 
deoxynucleoside triphcwsphates or their analogs. For convaiieoice, the description that follows 
uses DNA as the template. However, RNA is also included, such as described for the present 
aspect. 

[0021] Amplification by tlie PAP method can be linear or exponentiaJ. Lmeffl: amphfication is 
obtained when the activatable ohgonucleotide P* is the only complementary ohgonucleotide 
used. EjqKmential amplification is obtained when a second opposing oligonucleotide, which may 
be a P*, is present tliat is complementary to the desired nucleic, acid strand. The activatable 
ohgonucleotide P* and the second oligonucleotide flank the region that is targeted for 
amplificatiGn. In step (a) the second ohgonucleotide anneals to the separated desired nucleic 
acid strand product of step (d). In step (c) polymerization extends the second ohgonucleotide on 
the desired nucleic acid strand to synthesize a copy of the nucleic acid template strand, hi step 
(d) the synthesized nucleic acid template strand is separated from the desired nucleic acid strand. 
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Steps (a) through (d) are repeated until the desired level exponential amphfication has been 
achieved. 

[0022] In the PAP method, a mismatch between the activatable oHgonucleotide P* and the 
template sfand results in no substantial amplification, if the mismatch ocGars in the 3' specific 
subsequence of P* at the 3' terminus of P* or within 16 nucleotides of the 3' terminus of P*. 
This lack of amplification for such mismatches in the 3' specific subsequence of P* provides 
fom- billion different and specific oHgonucleotides with one base substitution resolution. 
[0023] In a prefeired aspect, the PAP mefliod is used for exponential amplification of a rare, 
mutant allele in a mixture containing one or more wild-type alleles. Strands of Ihe alleles ate 
separated to provide single-stranded nucleic acid, and Ihen the following steps are carried out 
serially. 

[0024] (a) Annealing to the sense or antisense strands of each aUele a complem^ 
activatable 2'-deoxyoUgoiiacleotide P* that has a non-extendible 3' terminus. The non- 
extendible 3' tenninus may be, e.g., a non-extendible 2',3'-dideoxynucleotide or an 
acyclonucleotide. P* has no 2'-deoxynucIeotides at or near its 3' terminus that mismatch the 
coiresponding 2'-deoxynucl6otides on the mutant strand, but has at least one 2'-deosynucleotide 
at or near its 3' terminus that mismatches the corresponding 2'-deoxynucledtide on the wild-lype 
stand. Consequently, the non-extendible 3' terminus is hybridized to tiie mutant strand but not to 
the wild-lype strand when the oHgonucleotide P* is annealed.- Simultaneously, a second 
2'^eoxyoligQnucleotide fiiat is complemeaitaiy to Ihe anti-paraUel strands of each allele is 
annealed to the anti-parallel strands. The activatable 2'-deoxyoIigonucleotide P* and the second 
2'-deoxyoHgonucleotide flank the region of the gene to be amplified. 

[0025] (b) Pyrophosphorolyzing Ihe activatable 2'-deoxyoHgonucleotide P* that is annealed to a 
mutant stand with pyrophosphate and an enzyme that has pyrophosphorolysis activity. This 
activates the 2'-deoxyoUgomxcleotide P* that is annealed to the mutant strand by removal of the 
hybridized non-extendible 3' temiinus. It does not substantially activate the 
2'-deoxyoligonucleotide P* when it is annealed to the mutant strand because the non-hybridized 
non-extendible 3' terminus is not substantially removed by the pyrophosphorolysis. 
[0026] (c) Polymerizing by extending the activated oligonucleotide P* on the mutant strand in 
presence of four nucleoside triphosphates or tiien analogs and a DNA polymerase and extending 
the second 2'-deoxyoligonucleotide on both mutant and wild-type anti-parallel strands. 
[0027] (d) Separating the extension products of step (c); 
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[0028] (e) Repeating steps (a)-(d) until the desired level of amplification of the mutant allele has 
been achieved. 

[0029] The activatable 2'-deoxyoligonucleotide P* is annealed to tlie antisense strands of the 
alleles and the second 2'-deoxyo]igonucleotide is annealed to tlie sense strands, or vice versa, 
[0030] Steps (a) to (c) of PAP can be conducted sequentially as two or mor^ temperature stages 
on a ihermocycler, or they can be conducted as one temperature stage on a thennocycler. 
[0031] Nucleoside triphosphates and 2'-deoxynucleoside triphosphates or their chemically 
modified versions may be used as substrates for multiple-nucleotide extension by PAP, i.e., 
when one nucleotide is incorporated the extending strand can be fiirtheT extended. 
2',3'-dideoxynucleoside triphosphates, their chemically modified versions, acyclonucleotides or 
other blocked nucleotides which are terminators for fiirther exteiision may he used for single- 
nucleotide extensidn. 2',3'-dideGxymicleoside tr^hosphates naay be labeled with radioactivity or 
dye for diJBfereaitiation from the 3' temainal dideoxynucleotide, if present, of oligomicleatide P*. 
Mixtures, of nucleoside triphosphates or 2'-deo3syniicleotide triphosphates or their analogs, and 
2',3'-dideoxynttcieoside triphosphates or their analogs may also be used. 
[00»] PAP can be used in a novel method of DNA sequence determination. In PAP, 
pyrophosphoroljreis and polymerizatibn by DNA polymerase are coupled serially by using P*, 
an oligonucleotide contaiiung a non-extendible 3' terminus. ITie norn-extendible 3' terminus may 
be, e.g., a non-extendible 3'-deoxynucieotide or an acyclonucleotide. This principle is based on 
ttie specificity of PAP and in turn cm the base pairing specificity of the 3" specific subseqpience. 
This property of the 3' specific subsequence can be appHed to scan or resequence for unknown 
sequence variants, to determine de novo DNA sequence, to compare two DNA sequences, and to 
monitor gene eixpression profiling in large scale. A P* array is possible m these methods. That 
is, each of the P=^s can be immobilized at an individual dot or a solid support, thus allowing all 
the PAP reactions to be processed in parallel. 

[0033] Thus in one aspect, the PAP method is used for scanning or resequencing unknown 
sequence variants within a predcteimiued sequence by carrymg out the following steps serially. 
[0034] (a) Mixing under hybridization conditions a template strand of the nucleic acid with 
multiple sets of four activatable oligonucleotides P* which are sufficiently complementary to the 
template strand to hybridize therewith. Within each set the oligonucleotides P* differ, from 
each other in having a different non-extendible 3' terminus, so that flie non-extendible 3' 
tenninus is hybridized to the teanplate strand if the template strand is complementary to the non- 
extoidible 3' teiminus. The number of sets corresponds to the number of nucleotide in the 
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sequence. The non-extendible 3' terminus may be, e.g., a non-extendible S'-deoxynucleotide or 
an acyclonucleotide. 

[0035] (b) Treating the resulting duplexed P*s with pyrophosphate and an enzyme that has 
pyrophosphorolysis activity to activate by pyrophosphorolj^is only tiiose oUgonucleotidea P* 
which have a non-extendible 3* terminus that is hybridized to the teoaplate strand. 
[0036] (c) Polymerizing by extending the activated oligonucleotides P* on the template straad 
in ptreseace of four nucleoside triphosphates or their analogs and a nucleic acid polymerase. 
[0037] (d) Separating the nucleic acid strands synthesized in step (c) fiom the template strand. 
[0038] (e) Repeating stqps (a)-(d) until a desired level of amplification is achievec^ and 
[0039] (f) Arranging the nucleic acid sequence in order by analyzing overlaps of oligonuclotides 
P* -fttat produced amptifications. 

[0040] In a second aspect, the PAP method is used for detemiimng de novo the sequence of a 
nucleic acid by canying out the following steps serially. 

[0041] (a) Mixing under hybridization conditiohs a template strand of the nucleic acid with 
multiple activatable oligonucleotides P*. AU of the oligonucleotides P* have the same number n 
of nucleotides as the template and constitute collectively all possible sequences having n 
nucleotides. All of the oligonucleotides P* have a non-extendible 3' teoaninus. The non- 
extendible 3' terminus may be, e.g., a non-extendtble S'-deoxynucleotide or ah acyclonucleotide. 
Any oligonucleotides P* that are sufficiently complementary will hybridize to the template 
strand. The non-extendible 3' termiaus will hybridize to the template strand only if the template 
strand is complementary at the position corresponding to the 3' terminus. 
[0042] (b) Treating the resulting duplexed P*s with pyrophosphate and an enzyme that has 
pyrophosphorolysis activity to activate only those hybridized ohgomicleotides P* which have a 
non-extendible 3' tenniaus that is hybridized to the template strand, by pyrophosphorolysis of 
those hybridized non-extendible 3' termini. 

[0043] (c) Polymerizing by extending the activated oligonucleotides P* on the template strand 
in presence of four nucleoside triphosphates or their analogs and a nucleic acid polymerase. 
[0044] (d) Separating the nucleic acid strands synthesized in step (c) from the template strand. 
[0045] (e) Repeating steps (a)-(d) until a desired level of amplification has been achieved, and 
[0046] (f) Determining the sequence of ohgonucleotides P* that produced amplifications, then 
arranging the nucleic acid sequence in order by analyzing overlaps of these oligonucleotides. 
[0047] PAP can also be used to study chromatin structure analogously to ligation-mediated VCR. 
(IM-PCR) by carrying out the following steps serially. LM-PAP has been used for the 
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detmninatioii of primary nucleotide sequence, cytosine methylation patterns, DNA lesion 
formation and repair and in vivo protein-DNA footprints (Dai et al., 2000; Mueller and Wold, 
1989; Pfeifer et al., 1989; Pfeifer et al, 1999; Becker and Grossman, 1993). Ligation-mediated 
PAP (LM-PAP) involves cleavage, primer extension, linker ligation and PAP tliat can be applied 
for analysis of in vivo chromatin stnicture, such as, methylated state of chromosomes, and for 
other nucleic acid analysis as for LM-PCR, 

[0048J The nature of LM-PAP is that the template is synthesized before PAP, such as by 
ligation reaction or by extension using terminal transferase. PAP may be any type of PAP: with 
only one P*, with two opposing oligonucleotides where at least one is P*, Bi-PAP, matched 
PAP, mismatched PAP, and so oil Thus, at its simplest, LM-PAP is the application of PAP to a 
presynfhsized template. LM-PAP may be peifonned by steps (i), (ii), (iii), (iv) (v), by 
steps (i), (ii), (iii) and (vi), by steps (ii), (iii), (iv) aod (v) or by stq)s (ii), (iii) and (vi), where the 
steps are as follows. 

[0049] (i) The cleavage occurs chetnically, enzymatically or naturally to "breakdown" nucledc 
add strands. The nucleic add usually is genomic DNA timt may have lesions or mcks produced 
in vivo, 

[0050] (ii) The oligonucleotide PI is genjerspecific and its extension indudes: 1) annealing to 
the template strand a substantially pompiemsntary oligonucleotide; 2) ejctending the 
oligonucleotide on the template strand in the preseaice of nucleoside tripliosphates or their 
analogs and a nucldc acid polymerase, the extension "runs off at the deavage site on the 
teai5)late strand. Steps 1) and 2) may be repeated. 

[OOSl] The primer extension may be replaced by a P* exteusion, in which the above PAP is 
performed with only one activatable oligonucleotide P*. 

[00S2] (iii) The linker ligation step includes ligation of a linker to the 3' teiminus of the 

synthesized nucleic acid strand. The linlcer ligation step may be rqplaced by a taminal 

transferase extension that is non-template dependent polymerization and an extra nucleie acid 

sequence is added to the 3' terminus of the synfliesized nucleic acid strand. 

[0053] (iv) PGR is performed with a second gene-specific oligonucleotide (P2) together with an 

oligonucleotide specijSc for the linker or the added sequence by terminal transferase. 

[0054] (v) A third gene-specific P* (P3) is used to detect the PGR generated fragments. PAP 

method is applied with only one activatable oligonucleotide P*. The extension of the activated 

ohgonucleotide P* "runs off' at the end of the template strand gemerated in IV. The PAP 

method may be applied in an allele-specifio manner. The activatable oligonucleotide P* may 
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contain one or more nucleotides that are not complementary to the template strand. The 
uncomplimentary tmcleotide(s) of P* may locate at the 3' terminus of P*. 
[0055] (vi) Instead of steps (iv) and (v), PAP method can be applied with two opposing 
oligonucleotidey of which at least one is the activatable oligonucleotide P*. The activatable 
ohgonucleotide P*(P3) is gene-specific. The second oligonucleotide is specific for the linker or 
the added sequence by terminal transferase. The second oligonucleotide may be another 
activatable oligonucleotide P* or a regular ohgonucleotide. The PAP mediod may be applied in 
an allele-specific manner. The activatable ohgonucleotide P* (PS) may contain one or more 
nucleotides that are not conqjlementary to the template strand. The imcon^limeotary 
nncleotide(s) of P* may locate at the 3' terminus of P* (P3). 

[0056] The third gene^specific oligonucleotide (P3) is then usually used to label and allow 
visualization of the PGR generated fragments. P3 is labeled at the 5' tecminus with or, more 
recently, with near iafrared fhiorochromes such as IRD 700 or IRD 800 (Li-Cor Inc.) (Dai et aL, 
2000). 

[0057] PAP can be used to detect a target imcleic acid, la one embodiment this mefliod involves 
the following steps: 

[0058] (a) adding to a nucleic acid containing sample an oligonucleotide P*, wherein ttie 
oligonucleotide P* has a non^extendible 3' teonninus, wherein the 3' terminal residue of 
oligonucleotide P* is removable by pyrophosphorolysis, and wherein liie oligonucleotide P* 
amieals to a substantially ccfmplementaiy strand of the target nucleic acid present in the sample; 
[0059] (b) removing the 3' non-extendible tesrminus of the oligonucleotide P* annealed to the 
substantially complementary strand of the target nucleic acid by pyrophosphorolysis to unblock 
the oligonucleotide P* to produce an unblocked oligonucleotide; and 

[006Q] (c) detecting the presence of the target nucleic acid, wherein the sequence of the target 
nucleic acid is substantially complementary to the sequence of the oligonucleotide P*. 
[0061] The method of the first embodiment may further include before the detection step the 
step: (bl) extendhig the unblocked oligonucleotide using a nucleic acid polymerase to produce 
an extended oligonucleotide. The method may also include the addition of a second 
oligonucleotide which may or may not have a 3' non-extendible terminus, The second 
oligonucleotide may anneal to the substantially complementary strand of the target nucleic acid 
or it may anneal to the complement of the substantially conrplementarty strand of the target 
nucled acid. 
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[0062] In a second embodiineiit for detecting a nucleic acid the method involves the following 
steps: 

[0063] (a) adding to a nucleic acid containing sample two oligonucleotide P*s, wherein each 
oligonucleotide P* has a non-extendable 3' terminus, wherein the 3' terminal residue of each 
oligonucleotide P* is removable by pyropho^horolysis, wherein one oligonucleotide P* 
overlaps with the other oligonucleotide P* by at least one nucleotide at their respective 3' ends, 
and wherein one oligonucleotide P* anneals to a substantially complementary strand of the 
target nucleic acid present in (he sample and the other oligonucleotide P* anneals to a 
complement of the substantially complementary strand of the target nucleic acid; 
[(M)641 (fa) removing the 3' non-extendable terminus of the oHgonucleotide P*s annealed to the 
target nucleic acid by pyrophosphorolysis to unblock the oligonucleotide P*s to piodace 
unblocked oligonucleotides; and 

[00^5] (c) detecting titie presence of the target nucleic acid, whea-ein the sequence of the target 
nucleic acid is substantially complementary to Ihe sequence of the oligonucleotide V*s. 
[0066] The method of the second embodiment may further include before tiie detection step the 
step: (bl) extending the unblocked oligonucleotide using a nucleic acid polymerase to produce 
an extended oligonueleotide, 

[0067] In one embodiment, the defection of iJie micledc acid in step (c) is performed by detecting 
the unblocking of oligonucleotide P*. lo one aspect, the unbloddng is detected by loss of a 
label contained in the 3*^ tenninal residue of oligonucleotide P'''. In a second aspect, the 
unblocking is detected by detecting the presence of a 3' OH on tiie 3' tenninal residue tiiat is 
capable of extension or ligation. In this aspect, the detection is determined by extending ttie 
unblocked oligonucleotide or by ligatiug the unblocked oligonucleotide to an oHgonucleotide, 
hi a second embodiment, the detection of the nucleic acid in step (c) is performed by detecting 
the exteuded oligonucleotide. In one aspect, the extended oHgonucleotide is detected by the 
presence of a label in tibe extended oligonMcleotide. The label is part of a nucleotide or 
nucleotide analog used in the extension step. In a second aspect, the extended oligonucleotide is 
detected by gel electrophoresis. In a third aspect, the extended oligonucleotide is detected by the 
binding or incorporation of a dye or spectral material. 

[0068] The P* oligonucleotides are selected to be "substantially", complementary" to the 
different strands of each specific sequence to be amplified. Therefore, the P* oHgonucleotide 
sequence need not reflect the exact sequence of the template. For example, a non- 
complcanentary nucleotide segment may be attached to the 5'-end of the P* oligonucleotide, with 
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the remainder of the P* oligonucleotide sequence being complementary to the strand. 
Alternatively, non-complementary bases or longer sequences can be interspersed into the P* 
oHgonucleotide, provided that the P* oligonucleotide sequence has sufficient complementarily 
with the sequence of the strand to be amplified to hybridize therewith and form a template for 
synthesis of the extension product of the other P* oHgonucleotide. The abihty to detect 
nucleic acid sequences which are substantially complementary to ohgonucleotide P* is 
particularly useful for the detection of multiple mutations, such as seen m high viral load, where 
the detection of the presence of the virus is important and not necessarily the exact nucleic acid 
sequence of the virus. This method is also capable of detecting nucleic adds that are completely 
complementary. 

[0069] The present invention also includes other modifications of PAP. 

[0070] • The activatable oligonucleotide P* may contain blocked nucleotides at other positions 

in addition to the 3' termimis. 

[0071} • The introduction of interaal blocking nucleotides results in an interfece betweesn 
amplification and PAP which would pennit PAP to amplify in a nan-exponential manner (e.g., 
quadratic or geometric) with higher fidelity, ie., errors made by the polymerase would not be 
propagatable, 

[0072] • The activatable oligonucleotide P* may contain modified nucleotides that are 
extendible as well as the 3' blocked nucleotide. Thus, anywhere 5' to the 3' terminus, there may 
be cither blocking or non-blocking modified nucleotides. 

[0073] • A polymerase that pyiophosrphorolyzes the mismatched piimer rather tlian the matched 
primer could be used to detect rare mutatioiK im^rfxich the P* that mism^hed at Ihe 3' terminus 
is activated and extended. 

[0074] • The detection of a rare mutation is based on no mismatch aaywhere along the length of 
the oligonucleotide because a mismatch inhibits the activation of P*s. 

[0075] • Activation may occur by another mechanism, such as a 3' exonuclease. The 3* 
exonuclease may have specificity for the matched primer or the mismatched primer so that it 
discriminates as to whether there is a mismatch at the 3' end. The 3' exonuclease can be used 
either way. If it prefers a mismatch, it can be used as described above, but its ability to detect 
uncommon mutations would depend on some specificity for activation, although that specificity 
may come partly from internal mismatches. 



wo 03/095664 PeT/US03/14556 
13 

[00761 ' The "extension reaction can be performed by a DNA polymerase, an KNA polymesrase 
or a revesrse transcriptase, Ihe tempMe may be a DNA or an RNA, and Uie oligonucleotide P* 
may be a DNA, an RNA, or a DNA/RNA heteromer, 

{0077] • Pyrophosphorolysis and the extension can be performed by different polymerases. For 
example, the P* may include a penultimate modified oligonucleotide that could not be extended 
by pyrophosphorolyzing polymerase but could be extended by another polymerase. One 
example is a 3' dideoxy tiiat could be pyrophosphorolyzed by a DNA polymerase, but the 
presence of a ribonucleotide in the penultimate position would require extension by aa RNA 
polymerase. 

[0078] • PAP can be generalized as an inactive oligonucleotide that is activated by a nucleic 
acid metabohzing enzyme, such as helicases, topoisbmerases, telomerases, RNAH or restriction 
enzymes. 

[0079] • Methylases woidd detect the presence or absence of a me*hyl gtoiq) in genomic DNA. 
Methylases could be coupled with truncating amplification which farces the polymerase back to 
the template. 

[0080] • A P* in which the 3' end is a dideoxy and penultimate few nucleotides are ribos can be 
used as a tool for differentially making a protem product derived from a specific mutation that 
was desired, or for m a kin g a protein product: whose expression is linked to the presence of a 
particular sequence. PyrophDsphorolysis would activate the P* if there was a precise match to 
the mutation at the 3' end. The activated oligonucleotide is then a substrate for the generation of 
RNA by an RNA polymerase. The RNA could then be translated in vitro to produce the protein 
product. 

[0081] • PAP (PAP, Bi-PAP, matched or mismatched PAP, simplex PAP, multiplex PAP and 
othecs) can be used for quantificatiom The yield of the amplification products is quantitatively 
associated with the amount of iiq)ut template. The association may be proportional or otherwise. 
[0082] • In PAP, product may accumulate linearly, exponentially or otherwise. 

BRIEF DESCRIPTION OF THE FIGURES 

[0083] Figure 1 shows a schematic of the detection of a rare mutatica by allele-spedfic PAP 

(PAP-A). 

[0084] Figure 2 shows a schematic of bidirectional PAP-A (Bi-PAP-A). 

[0085] Figure 3 shows a schematic of PAP-based resequencing (PAP-R) performed on a 
microairay with programmable photochemical oligonucleotide. 



wo 03/095664 PCT/US03/14556 
14 

[0086] Figure 4 shows a sdieanatic of microarray-based "resequencing to detect a G to A 
mutation. 

[0087] Figure 5 shows a schematic of ligation-mediated PGR (LM-PCR), 
[0088] Figures 6A and 6B are a schematic illustrating use of PAP to detect tiie G allele at 
nucleotide 229 of Ihe Di dopamine receptor gene. The procwiure is described in detail in 
Example 1 below. 

J0089] Figure 6C is an autoradiogram of PAP from the G/G, A/A and G/A genotypes of the 
human dopamine receptor gene. 

[0090] Figures 7A and 7B are diagrams illustrating enhanced specificity of PAP relative to 
PASA. 

[0091] Figures 8A and 8B are autoradiograms showing the results of electrophoresis of samples 
obtained in Example 1 below. 

[00921 Figure 9 is an autoradiogram showing the results of electrophoresis of samples obtained 
in Example 1 below. 

[0093] Figure 10 is an autoradiogram showing the results of electrophoresis of samples obtained 
in ExaiDple 1 below. 

[0094] Figure 1 1 A is a schematic illustrating enhancement of PAP efficiency. 

[0095J Figure liB is an autoradiogram of PAP from the G/G, A/A and G/A genotypes of the 

hiiman dopamine recqjtor gene. 

[0096] Figures 12A-12E are autoradiograms showing the resulte of electrophor^s of samples 
obtained in Example 2 below. 

[0097] Figure 13 is an autoradiogram showing the results of electrophoresis of sanaples obtained 
inEx^ple2below. 

[0098] Figure 14 is am autoradiogram showing the results of electrophoresis of samples obtained 
in Example 2 below. 

[0099] Figure 15 is an autoradiogram showing the results of electrophoresis of samples obtained 
in Example 3 below, 

[0100] Figures 16A-16B show UV footprintifcog by LM-PAP. Fig. 16A shows allele-specific 
LM-PAP versus allele-specij&c LM-PCR for Ihe dopamine Dj receptor gene. Fig. 16B shofws 
LM-PAP for the pgk gene. 

[0101] Figures 17A-17B show PAP amplification directly from htmaan (Fig. 17A) and mouse 
(Fig. 17B) genomic DNA using PAP and Bi-PAP, respectively. 
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[0102] Figures 18A-18E show PAP ampHfication using 3' terminai acyclonucleotide blocked 
P*. Fig 18A; Model: A duplexed DNA template of the lad gene is shown. The mutated 
template contains a G at the nucleotide position 369, while the wild-type template contains a T 
at the nucleotide position 369 of the lad gene. P* = pyrophosphorolysis activatable 
oligonucleotide. The P* has an acycloNMP or a ddNMP at the 3' termmus. Hie P* is specific to 
tbie mutated template but mismatches to the wild-tj^je template at the 3' tenninus (Table 6). 0 = 
oligodeoxynucleotide. PAP was perfonned with P*l and 01, P*2 and 02, or P*l and P*2, 
respectively. Fig. 18B: PAP with 30 mer P*s: The P*s are specific for the mutated template but 
mismatch the wild-type template at their 3' termicus, hi lanes 1-8 are 3' tesmiDal 
acyclonucleotide blocked P*s. In lanes 9-16 are 3' terminal dideoxynucleotide blocked P*s for 
comcparison. In lanes 1-4 and. 9-12, the mutated template is used. In lanes 5-8 and 13-16, the 
wild-type template is used. The PAP product and P* are indicated with their sizes. Lane M is 
12QQg of (|>X174-PUC19/flaaZZZ'DNA marker. Fig. 18C: PAP with 35-iner P*s: The experiment 
is tiie same as in Fig. 18B except with 35-mer P*s that are 3' co-testminal with the 30-mer P*s 
and five nucleotides longer at their 5' termini. Fig. 18D: PAP wilh Vent (exo-) polymerase. The 
experimeatit is the same as in Fig. 18B exc^t that Vent (exo-) was used. Fig. 18E: PAP with 
(exO") polymerase. The eiq)eriment is the same as in Fig. 19B except that Pfii (exo-) was used. 
10103] Figure 19 shows ttiat PAP has high selectivity to detect rare mntations in the abundance 
of the wild-type template. In the example of nucleotide position 190, the mutation-specijBc P* 
matches the mutated A template but mismatches the wild-type T template at the 3' terminus. 
Specific and efficient amplification is indicated by thick arrows. When hybridized to the 
mutated A template, the P* cannot extend directly fborn the 3' terminal dideoxynucleotide, the 3' 
terminal ddTMP must be removed by pyrophosphorolysis and'the activated oligonucleotide is 
thcJi extended efficiently. Two types of nonspecific amplification Jxom the T template are 
indicated as Types I and 11. The nonspecific amplification occurs rarely when mismatch 
pyrophosphorolysis occurs to generate a wild-type product that will not support efficient 
amplification as template for subsequent cycles (Type I) (the error is radicated by thin arrow and 
estimated frequency of as low as 10"^). "When botti mismatch pyrophosphorolysis and 
misincorporation occur extremely rarely to generate a mutated product (Type H) (the errors are 
indicated by thin arrows and estimated coupled frequency of 3.3x 10'^^). Once the errors occur, 
the mutated product can be ampIiSed esfcponentially in subsequent cycles and so it determines 
the selectivity. 
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[0104] Figures 20A-20B show Bi-PAP ampKfication. Fig. 20A: Schematic of Bi-PAP to 
detection a rare mutation. The mutation-specific assay with two mutated. P* for nucleotide 190 is 
shown. Tlie downstream and upstream P*s contain a dideoxy T and a dideoxy A at their 3' 
termini, respectively. They ai-e specific for the T:A allele at nucleotide 190 (on the right), but aie 
mismatched to the A:T wild-type allele at their 3' termini (on the left). The P*s are 40 
nucleotides long and overlap at their 3' termini by one nucleotide. On the left, no substantial 
product is generated from the wild-type template because of the mismatch. On the rigjit, the 
mutated product is geaieErated efEcieaotly fiom the mutated template. Fig, 20B: Bi-PAP 
amplification directly from k DNA. Each of the wild-type and mutation-specific Bi-PAP assays 
at nucleotide 190 was used to amplify a 79-bp segment of the lad gene from k DMAs. For the 
wild-tj^e assay in lanes 1-3, the two wild-type P*s have 3' terroinal ddA and ddT, respectively. 
For the mutation-spedfic assay in laaes 4-6 and lanes 7-9, the two mutated P*s are witii ddT and 
ddA at their 3' termini, respectively. In lan^ 1, 4 and 7, 2000 copies of the wild-type template 
were added to each reaction. In lanes 2, 5 and 8, 2000 copies of the mutated template were 
added to each reaction. In knes 3, 6 and 9, no template was added. In knes 7-9, 200 ng of 
human genomic DNA was added as carrier. The product and P* are indicated. Lane M is 120ng 
of <t)X174-PUCl9/iJaein DNA marker. 

[0105J Figures 21A-21C show titration of template for sensitivity and selectivity of Bi-PAP. 
With the mutated P*s, the wild-type template was amplified to generate the mutated product in 
Experiment I. The mutated template was amplified to generate the mutated product in 
Experiments H, IE and IV. Fig. 21A: The mutation-spedfic Bi-PAP assay for A190T. In 
Experiment I, the copies of the wild-type X DNA are indicated in lanes 1-5. Lane 6 is a negative 
control with no DNA. In Experiment H, the copies of the mutated X DNA are indicated in lane 
7-11. Lane 1 1 (0.2 copy) is a negative control to support the dilution accuracy of copy number. 
Lane 12 is a negative control with no DNA. La Experiment HI, the copies of the mutated X DNA 
m thepreseaiceoflxlO* copies of the wild-type X DNA are indicated in lane 13-17. Lane 18 is a 
negative control with, only the wild-type X DNA. In Experiment IV, the copies of the mutated X 
DNA in the prepuce of 100 ng of human genomic DNA are indicated in lanes 19-23. Lane 24 is 
negative control only with the human genomic DNA. L^e "C WT" is the wild-t5fpe product 
control in which the wild-type P*s were used to amplify 2000 copies of the wild-type X DNA. 
Lane "C Mut" is the mutated product control in which the mutated P*s were used to amplify 
2000 copies of the mutated X DNA, The wild-type and mutated products with unique mobilities 
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are indicated. Fig. 21B: The mutation-specific Bi-PAP assay for T369G. Fig. 21C: The 
mutation-specific Bi-PAP assay for T369C. 

[0106] Figure 22 shows a design of inicroarray for Bi-PAP resequeucing. Bi-PAP can be 
used for resequenciug to detect raJf nown mutations iii a known region on a micaroarray. The P*s 
5 are designed according to the wild-type template. The two opposing P*s for each Bi-PAP are 
anchored in a microarray spot. Each pair of arrows represents four Bi-PAPs for one nucleotide 
position. A mutation is indicated on the template, and it spans six overlapped P*s. On the 
microarray, many Bi-PAPs can he processed in a parallel way. 

[01071 Figures 23A-23B show a schematic of Bi-PAP resequenciug. Fig, 23 A: Detection of the 

10 wild-type sequence. This is a close look at the microarray. The P*s are designed according to 
the wild-type sequence. On liie position of nucleotide A, four Bi-PAPs are synthesized wilii fbux 
pairs of P*s. The four downstream P*s have identical sequence, except that at the 3' terminus 
either ddAMP, ddTMP, ddGMP or ddCMP, corresponds to the wild-type sequence and the three 
possible single base substitutions. The four corresponding upstream P'^s have identical sequence, 

15 except that at the 3' tetminus either ddTMP, ddAMP, ddCMP or ddGMP. Each pair of P*s have 
one nucleotide overly at Qieir 3' termini. On the next nucleotide C, another four pairs of P*s are 
synthesized (not shown). If the wild-type sample is added, only the wild-type Bi-PAPs generates 
tlie specific product that is labeled by fluorescence. In this way, to scan a 1 kb region, you need 
8000 P*s. Fig. 23B: Detection of an A to T mutation. On the mutated nucleotide T, the 

20 mutation-specific Bi-PAP genesrates the mutated product. On the nesrt nucleotide G, no product 
of Bi-PAP is generated because each pair of P* contains one or two mismatches (not shown). 
[0108] Figures 24A-24B show Bi-PAP resequenciug microarray. Fig. 24A: Detection of the 
wild-type sequence. Four pairs of P*s are designed for each nucleotide position according to the 
wild-type sequence. Each pdr of P*s are downstream and upstream directed, and have oiie 

25 overlap and complementary nucleotide at Iheir 3' termini. The wild-type P* pair are specifically 
amplified on each nucleotide position. If all of the wild-type P* pairs specifically amplified, the 
wild-type sequence can be determined. Fig. 24B: Detection of the A to T mutation. With the 
mutated template, the mutation-specific Bi-PAP is amplified. There is a window of no Bi-PAP 
signals centered by the mutation-specific Bi-PAP and three successive nucleotides on each side. 

3 0 The paired specific subsequence is supposed to be seven nucleotides long. Any unknown single- 
base substitution can be determined, even if it is a heterozygous mutation. Also, small deletions 
and insertions can be detected and localized 
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[0109] Figure 25 shows PAP de novo sequencing on microaixay. PAP can also be used for de 
novo DNA sequencing of an unknown region. The paired specific subsequence is supposed to 
fifteen nucleotides long. P* pairs of a complete set of the paired specific subsequeace are on a 
microarmy with loiown addresses. After the unlcnown DNA sample is added, Bi-PAP is 
performed. AH the amplified B-PAP products are collected and thesa the paired specific 
subsequences of the amplified P* pairs are assembled by one-nucleotide overlying. Thus, the 
unknown complementary sequence is reconstracted. 

[0110] Figures 26A-26C show the detection of somatic mutations. Fig. 26A: Eighteen genomic 
DNA samples of the lact transgenic mice were chosen. 2 iig genomic DNA of each sample was 
ainplified with the assay B to detect the T369G imitation two times. Samples 1-10 are j&om 
livers of 25^month old mice. Samples 1 1-14 are j&am hearts (samples 11, 13 and 14) aod ad^ose 
(sample 12) of 6-mgtitii old mice. Samples 15-18 are firom brains of lO^day old mice. P = 
positive control that amplified the mutated X DNA, N = negative control with no DNA, + = 
amplified product, - = no product Fig. 26B: The assay B was performed In lanes 1 1-12, 13-16 
and 17-20, 2 p.g, 0.5 pg and Q.125 of the lact mouse geaomioDNA of sample 12 were used 
in each reaction, respectively. Lanes I-I Q are controls; the copy number of the mutated X DNA 
per reaction was reconstincted by two-fold serial dilutions. lu lanes 1-10 and 13-20, each 
reaction also contained 1 pg of the lacT mouse genomic DNA carrier, ss = siagle-straaded, ds = 
double-stranded. Fig. 2fiC: The assay B was performed. Jii lanes 11-14, 2 p.g of the lacf 
mouse genomic DNA of sample 3 was used in eaeti reaction. la lanes 15-18, 2 ]ig of the lacf 
mouse genomic DNA of sample 9 was used in each reaction. Lanes 1-10 are controls; the copy 
number of the mutated X DNA per reaction is mdicated. Each control reaction also contained 1 
y.g of the /flcT mouse genomic DNA carrier. 

DETAILED DESCRIPTION OF THE INVENTION 
[0111] The following temmology is used herein. 

[0112] Pyrophosphorolysis: removal of the 3' nucleotide from a nucleotide strand chain by 
DNA polymerase in the presence of pyrophosphate (PP;) to generate the nucleotide triphosphate. 
This is the reverse of the polymerization reaction. 

[0113] PAP: Pyrophosphorolysis activated polymerization. PAP can use one P* or can use two 

opposing ohgonucleotides in which at least one is P*. 

[0114] P*: an oligonucleotide with a non-extendible 3' terminus (or end) that is activatahle by 
pyrophosphorolysis. 



wo 03/095664 



PCT/US03/14556 



19 

[0115] PAP-A: PAP-based allele-specific ampHfication that can be used for detection of rare 
mutations (Fig. 1). 

[0116] Bi-PAP-A: PAP-Apexfomied with a pair of opposing P*, i.e., bidirectional (Fig. 2) with 
at least one nucleotide cvarl^ at their 3' termini. 
5 [0117] PAP-R: PAP-based resequencing for detection of unimown mutations within a known 
sequence (Figs. 3 and 4). 

[0118] LM-PAP: hgation-mediated PAP. The nature of LM-PAP is that the template is 

synthesized before PAP, such as by ligation reaction or by extension usiiig terminal transferase. 

[0119] LM-PCR: hgation-mediated POR. (Fig. 5). 
10 [0120] or alleles: alleles of the common polymorphism of the dopamine Di receptor gene 

fliat was used as a model system herein (also refcared to herein as G** or A" alleles). 

[0121] Linear PAP: PAP wifli only one P* for linear product accumulation. 

[0122] Exponential PAP: PAP with two opposing oligonucleotides for exponential product 

accutonlaiion, and at least one is P*. 
15 [0123] Noise rate (%): the relative yield of the mismatched product to the matched product. A 

specific signal for PAP is defined as a noise rate of less than 10%. 

[0124] PASA: PGR amplification of specific alleles (also known as allole-specific PCEi or 
ARMS). 

, [0125] Reseqnencing: scanning for unknown mutations and detemiining the precise sequence 
2 0 changes within a known sequence. Resequencing is distinguished fiom de novo sequencing, 
[0126] Mutation load: the jBrequency aod pattern of somatic mu tations within a tissue. 
[0127] Minimal residual disease: e.g., rare remaining cancer cells in lymph nodes and other 
neighboring tissues or early recurrence after remission. 

[0128] Non-extendible 3' terminus {or end): a nucleotide or nucleotide analog at the 3' 
25 terminus (or end) of oligonucleotide P* that is non-extendible but that is activatable by 
pyrophosphorolysis. Examples of a non-extendible 3' termini (or sids) include, but are not 
limited to, a 2'3'-dideox3Tiucleotide, an acyclonucleotide, 3'-deoxyadenosnie (cordycepin), 3'- 
azido-3'-deoxythyniidnie (AZT), 2',3'-dideoxyiaosine (ddl), 2',3'-dideoxy-3'-tMacytidine (3TC) 
and 2',3'-didehydro-2',3'-dideoxytIiymidine (d4T). 
30 [0129] SimpJex PAP: one PAP (PAP, Bi-PAP, matched or mismatched PAP, and others) in one 
reaction tube or on a solid support. 

[0130] Multiplex PAP: more than one PAP (PAP, Bi-PAP, matched or mismatched PAP, and 
others) in one reaction tube? or on a solid siqyport, e.g., micro array. 
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[01311 Matched PAP: PAP having a matcU between P* and its template. 

[0132] Mismatched PAP: PAP having a mismatch between P* aad its template. 

[0133] Nested PAP: PAP using two or more pairs of P* in which one pair is located inside a 

second pair on a template nucleic acid. 

[0134] Hotstart PAP: PAP ia which an rasential reaction component is withheld imtil 
denaturation temperatures are approached (Chaio et al., 1992; Kellogg et al, 1994; MuUis, 
1991; D'Aquila et al., 1991). Essential reaction components can be withheld by, e.g., a 
neutraliziiig antibody boimd to the polymerase, sequestering a component, such as flie polymers 
or MgCl2 in wax, chemically mo(fifying the polymerase to prevent activation undl high 
temperatiire incubation or separating components by wax. 

[0135] Truncated Amplification: an amplification method which amplifies non-exponemtially, 
e.g., in a quadratic or geometric mannear, with over two chimeric oligonucleotides and produces 
truncated temrinal products that are no more than, fliree rounds of replication from the original 
template. (Liu et al., 2002). 

[0136] Reactive '3 OH: is a 3' OH that is capable of being extended by a nucleic acid 
polymerase or ligated to an oligonucleotide. 

[0137] DNA polymerases, which are critical to nucleic acid aroplificajion, catalyze some or ail 
of the following reactions: i) polymerization of deoxynucleotide trqjhosphates or their analogs; 
ii) pyrophosphorolysis of duplexed DNA in the presence of pyrophosphate (PP), [dNMPJn + 
x[PPi] ...[dNMP]n-x + x[dNTP]; iii) 3'-5'exon[Uclease activity (which does not require PPi), and 
iv) 5-3' exonuclease activity (Duetcher and Kornberg, 1969; Komberg and Baker, 1992). For 
Taq and TJJ DNA polymerases, polymerization and 5'-3' exonuclease activity have been reported 
(Cliieu et al., 1976; Kaledin et al., 1981; Longley et al., 1990). For T7 Sequenase™ and Thermo 
Sequenase™ DNA polymerases, pyrophosphorolysis can lead to the degradation of specific 
dideoxynucleotidq- terminated segments in Sanger sequencing reaction (Tabor and Richardson, 
1990; Vander Horn et al., 1997). 

[0138] Pyrophosphorotysis is generally of veiy minor significance because PP; is degraded by 
pyrophosphatase under normal physiological conditions. However, in the presence of high in 
vitro coiicentrations of PP;, pyrophosphorolysis can be substantial. For oligonucleotides with a 3' 
tenniaal dideoxy nucleotide, only pyrophosphorolysis is possible. Qnoe the dideoxy nucleotide 
is removed, the activated ohgonucleotide can be extended by polymerization. 
[0139] Pyrophosphorolysis activated polymerization ^PAP) offers a novel approach for 
retrieving a diversity of information from nucleic acids. The exceptional specificity of PAP 
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derives from the serial coupling of two reactions. PAP involves the activation by 
pyrophosphorolysis of a 3' terminal blocked oligonucleotide (P*) followed by extension of the 
activated oligonucleotide by DNA polymerization. Operationally, PAP involves the use of an 
activatable oligonucleotide (P*) in place of a normal oligonucleotide that can be directly 
extended. Examples of P* include an inactive dideoxy terminated oligonucleotide P* or an 
inactive chemically modified nucleotide lacking a 3' hydroxyl group, such as an 
acyclonucleotide, or having a non-extendible nucleotide teaminated oligonucleotide P*. 
Acycloclonucleotides (acycloNTPs) in which the sugar ring is absent are known to act as chain 
terminators in DNA sequencing (Saager et al., 1977; Trainer, 1996; Gardner and Jack, 2002). 
The activation of P* is inhibited by mismatches throughout the length of the oligorrucleotide. 
Mismatches evesn two nucleotides from the 5' terminus inhibit PAP amplification. 
[0140] Activation of aP* by pyrophosphorolysis ofiers extraoititnary specificity throughout the 
leaagth of P*. The enhanced specificity can be used to detect rare known mutations, to elucidate 
unknown mutations by resequeacing; to determine unknown sequence by de novo sequeticiiig, to 
measure gene expression levels, to compare two sequences, and to increase the specificity of m 
vivo analysis of chromatin structure. Microarray-based programmable photochemical 
oligoiiucleotide synthesis and PAP are synei^istic technologies. Thus, tiie enhanced specificity 
can be used for rapid, microarray-based resequencing, de novo sequencing, gene expression 
pirofiling and SNP detection. 

10141] A number of methods for enzymatic nucleic acid an^lification in vitro have been 
developed and can be adapted to detect known sequence variants. These include polymerase 
diain reaction (PGR) (Saiki et al., 1985; Saiki et al., 1988), ligase chain reaction (LCR) 
(Landegren, 1998; Barany, 1991) and rolling circle amplification (RCA) (Baner et al., 1998; 
Lizardi et al., 1998). Herein, we describe pyrophosphorolysis activated polymerization (PAP), 
an approach that has the potential to enhance dramatically the specificity of PGR allele-specific 
amplification (Sommer et al., 1989). PAP differs from corrections with PGR in multiple ways: i) 
the P* ongonucleotide is blocked at the 3' terminus and must be activated by 
pyrophosphorolysis, ii) pyrophosphorolysis and polymerization are serially coupled for each 
amplification, iii) PAP may be perforra.ed with one P* for linear amplification or with two 
oligonucleotides for e^onential amplification, iv) PPj is necessary for the amplification, v) 
significant nonspecific aiuplification would require the serial coiq)Iiag of errors of both 
mismatch pyrophosphorolysis and misincorppration. 
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[0142] The enhanced specificity of PAP relative to PASA is provided by serially coupHng 
pyrophosphorolysis and polymerization. Significant nonspecific amplification requires 
mismatch pyrophosphorolysis and misincorporation by DNA polymerase, an extremely rare 
event For example as d^cribed herein, DNA polymerase was utilized to detect the G allele at 
nucleotide 229 of the D] dopamine receptor gene. P* was synthesized eitlier with ddA, ddT, ddG 
or ddC at the 3' terminus. The 3' taminal dideoxynucleotide inhibits direct extension by 
polymerization, but can be removed by pyrophosphorolysis in the presence of pyrophosphate 
(FPi) when the P* is specifically hybridized with the complemetrtary strand of ttle G allele. The 
activated oligonucleotide can be extended by polymerization in the 5'-3' direction. 
[0143] Evidence is presented that pyrophosphorolysis followed by polymerization can be used 
to increase the specificity of PASA Sigdficant nonspecific amplification wifli PAP requires tte 
serial coupling of the two types of errors, i.e., mismatched pyrophosphorolysis and 
misincorporation (Fig. 1). Hie rate of mismatched pyrophosphorolysis is expressed as the 
relative rates of removal of a 3' mismatch deoxynucleotide relative to the correct 3' 
deoxynucleotide. The rate of mismatch pyrophosphorolysis is less than 10'^ for T7 DNA 
polymerase (Komberg and Baker, 1992; Wong et al., 1991). The misincorporation rate to cr^te 
a substitution mutation by polymerization, expressed- as the incorporation rate of an incon-ect 
versus a correct dNMP, was reported to be 10"* for T7 DNA polymerase and to be 10"^ for E. 
coll DNA polymerase I (Komberg and Baker, 1992; Wong et al., 1991; Bebenek et al., 1990). 
Similar results were reported for Tag DNA polymerase, 3 '-5' exonuclease-deficient mutants of 
T7 DNA polymerase and E. coli DNA polymerase I (Komberg and Baker, 1992; Wong et al., 
1991; Bebenek et al., 1990; Eckert andKunkel, 1990). 

[0144] PAP is a method of synthesizing a desired nucleic acid strand on a nucleotide acid 
template strand. In PAP, pyrophosphorolysis and polymerization are serially coupled for nucleic 
acid amplification using pyrophosphorolysis activatable oligonucleotides (P*). P* is an 
oligonucleotide that is composed of N nucleotides or their dialogs and has a non-extendible 
nucleotide or its analog at the 3' terminus, such as 3 ',5' dideoxynucleotide. When substantially 
hybridized on its template strand, P* could not be extended dhectly jfrom the 3' tenninal 
nucleotide or its analog by DNA polymerase, the 3' terminal nucleotide or its analog of the P* 
can be removed by pyrorphosphorolysis and them the activated oligonucleotide (< N) can be 
extended on the template. 

[0145] The method comprises the following steps oairied out serially. 



wo 03/095664 



PCT/DS03/14556 



23 

[0146] Annealing to the template strand a substantially complementary activatable 
oligonucleotide P*. This activatable oKgonucleotide P* has a non-ejctendible nucleotide or its 
analog at the 3' terminus. 

[0147] (b) Pyrophosphorolyzing the annealed activatable oligonucleotide P* with 
pyrophosphate and an aazyme that has pyrophosphorolysis activity. This activates 
oligonucleotide P* by removal of the 3' terminal non- extendible nucleotide or its analog. 
[0148] (c) Polymerizing by extending the activated oligonucleotide P* on the template strand in 
the presence of nucleoside triphosphates or their analogs and a nucleic acid polymerase to 
synthesize the desired nucleic acid strand. 

(0149] The PAP method can be applied to amplify a desired nucleic acid strand by the following 
additional steps. 

[0150] (d) Separating the desired nucleic acid strftnd of step (C) from the template strand, and 
10151] (e) Repeating steps (A)-(I)) until a desired level of aratplification of the desired nucleic 
acid strand is achieved. 

[0152] The above PAP method can be ^lied to allele-spedfic aniplification. The activatable 
oligonucleotide P* has c?ne or more nucleotides tliat are not conipiemeatary to the tesmplate 
strand. Theunoomplimentary nucleotide(s) of P* may locate atthe3' termimis of P"^. The above 
step of (A), (B) or (C) could not occur substantially. As the result, the desired nucleic acid sti^d 
is synthesized substantially less. 

[0153] The above PAP mefliod can be applied with only one activatable oligonucleotide P*. (e) 
Repeating steps (a)-(d), a desired level of amplification of the desired nucleic acid strand may be 
achieved linearly. The targeted nucleic acid region outside the. annealing region may be of 
different sizes or of different sequence contexts, so the synfh^zed nucleic acid strands are of 

different sizes or of different sequence contexts. 

[0154] The above PAP method can be ^plied with two opposing oligonucleotides of which at 
least one is the activatable oligonucleotide P*. The activatable oligonucleotide P* and the 
second oligonucleotide are targeted for amplification of a nucleic acid region. Steps (a)-(c) occur 
to the activatable oligonucleotide P*. The second oligonucleotide is substantially 
complementary to the other template strand. If the second oligonucleotide is another activatable 
oligonucleotide P*, steps (a)-(c) occur. If the second oligonucleotide is a regular extendible 
oligonucleotide, steps (a) and (c) occur: (modified a) amiealing to its template strand, followed 
by (modified c) polymerizing by extending the oligonucleotide on its template strand in the 
presence of nucleoside triphosphates or tiieir analogs and a nucleic acid polymCTSse to 
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synthesize the desired nucleic acid strand, (e) Repeating steps (a>(d), or steps (a), (c) and (d), a 
desired level of amplification of the desired nucleic acid strand may be acMeve4 e.g., 
exponentially. The targeted nucleic acid region between the two annealing regions of the two 
opposing oligonucleotides raay be of different sizes or of difterent sequence contexts, so the 
synthesized nucleic acid strands are of different sizes or of different sequence contexts, 
[0155] LM-PAP involves cleavage, primer extension, Mcer ligation and PAP that can be 
applied for analysis of in vivo chromatin structure, such as, methylated state of chromosomes. 
t0156] LM-PAP may be performed by steps (i), (ii), (iii), (iv) and (v), by steps (i), (ii), (Hi) and 
(vi), by steps (ii), (iii), (iv) and (v) or by steps (ii), (iii) and (vi), where Ihe steps are as Mk^m. 
[0157] The cleavage occurs chemically, enzymaticaily or naturally to "breakdown" nucleic acid 
strands. The nucleic acid usually is genomic DNA that may have lesions or nicks produced in 
vivo. 

[0158] (ii) The primer of PI is gene-specific and its extension incliuies; 1) annealing 1o the 
template straaad a substantially complementaiy primer; 2) extending the primer on the template 
strmid in the presence of nucleoside triphosphates or their analogs and a nucleic acid 
polymerase, the extension "runs off" at ihe cleavage site on the tonpkte strand. Steps 1) and 2) 
may be repeated. 

[0159J The primer extension may be replaced by a P* extension (The above PAP mth only one 
activatable oligonucleotide P*). 

[0160] (iii) The linker ligation step includes ligation of a iinlcer to the 3' tetminus of the 
synthesized nucleic acid strand. The linker ligation step may be replaced by a teiminal 
transferase extension that is non-template dependent polymerization and an extra nucleic acid 
sequaice is added to the 3' terminus of the synthesized nucleic acid strand. 
[0161] (iv) PGR. is performed with a second gene-specific primer (P2) together with a primer 
specific for the linker or the added sequence by terminal transferase. 

[0162] (v) A third gene-specific P* (P^) is used to detect the PGR generated firagments. PAP 
method is apphed with only one activatable oligonucleotide P*. The extension of the activated 
oligonucleotide P* "runs off at the end of the template strand generated in step (iv). The PAP 
method may be applied in allele-specific manners. The activatable oligonucleotide P* may 
contain one or more nucleotides that are not complementary to the template strand. The 
uncomphmentary nucleotide(s) of P* may locate at the 3' tennmus of P*. 
[0163] (vi) Instead of steps (iv) and (v), PAP method can be applied with two opposing 
oligonucleotides of which at least one is the activatable oligonucleotide P*. The activatable 
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oligonucleotide P*CP3) is gene-specific. The second oligonucleotide is specific for the linker or 
the added sequence by tenninal transferase. The second oligonucleotide may be another 
activatable oHgonucleotide P* or a regular primer. The PAP method may be applied in allele- 
specific maimers. The activatable oligonucleotide P* (F3) may contain one or more nucleotides 
that are not complementary to the template strand. The uncomplimentary nucleotide(s) of P* 
may locate at the 3' terminus of P* (P3). 

[0164] Fig. 1 shows detection of a rare mutation by allele-specific PAP (PAP-A). PAP-A can 
detect a rare allele with extremely high specificity because an allele-specific oligonucleotide 
with a 3' dideoxy tennimxs (P*) permits the serial coupling of pyrophosphorolysis and 
polymerization. For example, if an allele-specific oligonucleotide has a 3' dideoxy tenninus 
(P*) that matches a rare 'T" allele, activation occurs by pyrophospharolytic removal of the 
dideoxy nucleotide and is followed by polymerization (Situation A). Activation by 
pyrophosphoiolysis does not normally occur with a mismatch at the 3"tenninus as with the wild- 
type "C" allele (Situation B). Rarely, pyrophosphorolysis does occur at a mismatch (estimated 
frequency 10"^), but tiie adivated oligonucleotide is extended to produce wild-type sequence 
(Situation C), A product that siq)ports efficient amplification is generated when mismatch 
pyrophosphorolysis occurs, a polymerase error that inserts A opposite C in template DNA 
(Situation D). The fi:equency of mismatch pyrophosphorolysis coupled with the polymeErase 
mutation is estimated at 10^x3x10"^ = 3x10'". 

[016SI PAP has a potential specificity of 3x10"". Approaching this potential requires a design 
that eliminates confounding sources of enor. For exsmple, extension cirorB from non-blocked 
tq)stream ohgonucleotides can generate a product with ttie mutation of interest. If the 
misincorporation rate for TaqFS is about 10"^ per nucleotide and only one of the three 
misincorporattons generates the mutation of interest, the error rate is about 3.3x10'^. Polymerases 
that contaui a proofiieading function mi^t have an error rate per specific mutation of 3x10"''. 
Polymerases or polymerase complexes with lower error rates would improve specificity furflier. 
[0166] One approach utilizes linear PAP. Linear PAP-A may be performed for 40 cycles with 
only P* in the presence of a fluorescent or radiolabeled ddNTP. A labeled terminated product of 
defined size will be generated when P* is activated. Linear PAP-A has the advantage of utihzing 
only the original genomic DNA and eliminating error due to misincorporation from extension of 
an unblocked upstream primer. However, the sensitivity of detection is limited because the level 
of amplification is not greater than the number of cycles. For a simple genome like lambda 
phage, a detection specificity of 10"^ is possible. The specificity of linear PAP-A depends 
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critically on the absence of unblocked, extendible oligonucleotides. To achieve a robust 
specificity of 10"^, unblocked extendible oligonucleotides should be present at 10"'. This maybe 
achieved by treating gel purified P* (about 99.99% pure with our present protocol) with a 3' to 5' 
exonuclease to degrade unblocked molecules followed by repurification by gel electrophoresis. 
[0167] A second approach is bidirectional PAP-A (Bi-PAP-A; Fig. 2). In Bi-PAP-A, both the 
downstream and upstream oligonucleotides are P*s that are specific fer the nucleotide of 
interest. The P*s overlap at their 3' termini by one nucleotide. This design eUrmnates extension 
error fiom a non-blocked iqjstream oligoraicleotide. This design should not be limited by small 
amounts of active contaiiiiaatirig oKgonucleotide to which the dideoxy tenninus has not been 
added (about 0.01% with our current protocol) becajise the product generated will be tiiat of a 
control and wiU not be a substrate for ef&deirt amplification in subsequent cycles. 
[01^] Bi-PAP-A generates a product that is the size of a primer dimer. However, it is not a 
primer dimer in the conventional sense, in that template DNA with a mutation of interest is an 
inteimediate required to generate a product that is an efScient substrate for amplification in 
subsequent cytsles. Bidirectional PAP-A eliminates important bottlenecks to specificity and has 
the potential to reach a specificily of 1 0"^ 

[0169] As shown in Fig. 2, bolii the downstream and the upstream P*s are specific for the 
nucleotide of iaterest at the 3' tamini (an A:T base pair in tMs example). In the initial round of 
amplification fiom genomic DNA, segments of undefined size will be generated. In subsequent 
rounds, a segment equal to the combined lengths of the oligonucleotide minus one will be 
ampfified exponentially. Nonspecific amplification occurs at lower frequencies because this 
design eliminates misincorporation error ft-om an unblocked upstream oligonucleotide that can 
generate the A:T template from a G;C wild-type template with an error rate of 3x10"'". The P*s 
may be 30-60 nucleotides for most efficient amplification. Situation A shows that a template 
with a rare A:T allele will be amplified efficiently. Both the upstream and the downstream P*s 
are ampfified efficiently. Situation B shows that if the DNA template contains the wild-type 
G:C sequence, neither the downstream nor the upstream P* will be activated substantially. 
[0170] Rapid resequenciQg will faciHtate elucidation of genes that predispose to cancer and 
other complex diseases. The specificity of PAP lends itself to resequencing; P*s may be 
photochemically synthesized on microairays using flexible digital micramirror arrays. 
[0171] Microarrays of immobihzed DNA or oligonucleotides can be fabricated dlJier by in situ 
light-directed combinational synthesis or by conventional synthesis (reviewed by Ramsay, 1998; 
Marshall and Hodgson, 1998)- Massively parallel analysis can be perforoied. Photochemical 
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synfliesis of oligonucleotides is a powerM means for combi[mtorial parallel synthesis of 
addressable oligonucleotide microarrays (Singh-Gasson et al., 1999; LeProust et al., 2000). Tiiis 
flexible alternative to a large number of photolithographic masks for each chip utilizes a 
maskless array synthesizer with virtual masks generated on a computer. These virtual masks are 
5 relayed to a digital micromkror array. An ultraviolet image of the vktual mask is produced on 
the active surfece of die glass substrate by a 1:1 reflective imaging system. The glass substrate is 
mounted in a flow cell reaction chamber connected to a DNA synthesizer. Cycles of 
programmed chemical couphng occur after light exposure. By repeating tlie procedure with 
additional virtual masks, it is possible to synthesize oHgomucleotide microarmys with any 

10 desired sequence. The prototype developed by Singh-Gasson, et al. synthesized oligonucleotide 
miaroarrays containing more than 76,000 features measuring 16 square microns. 
[0172] By oombioing programmable photochsnical oligonucleotide synthesis with digital 
minors and oUgpnueleotide extension of P*, a high throughput and automated rhethod of 
resequencing is possible. PAP-R may detect virtually 100% of smgle base substitutions and 

15 other small sequence variants because of its high lednndancy; the mismatch spanned by the 
several overlapping P* oligonucleotides will prevecit activation of a cluster of overlapping P*s. 
One strategy for resequmcing is shown in Figs. 3 and 4. Fig. 3 shows a schematic of PAP-R 
performed on a miCTbanray with programmable photochemical oligonucleotide: PAP can be used 
for resequencing to detect unknown mutations. On this microarray, the wild-type template is 

20 indicated. The P*s are desigaed according to the wild-type template. The P*s that overlap with 
the mutation geaierate little of no signal indicated as "Low" PAP signal. 
[0173] Fig. 4 shows an example of sohd support-based, e.g., microarray-based, resequencing to 
detect a G to A mutation. Lmear PAP is performed with four different dye-labeled ddNTPs as 
substrates for single-base extensions. P*s have a specific region of 16 nucleotides withm the 3' 

25 region of the ohgonucleotide. Homozygous or hemizygous DNA template is utilized in the 
txample. Sets of four P*s, with identical sequence except for the four ddNMPs at the 3' 
terminus, are synthesized for each nucleotide position on the sense sixand of the \vild-type 
sequence. The P* with a ddA at the 3' termmus generates a PAP signal at llie site of the G-A 
mutation. The mutation also creates a 15 base "gap" of no PAP signal for the subsequent 

30 overlapping 15 sets of P*s. For heterozygous mutation, the P*s with ddA and ddG provide PAP 
signals. The heterozygous mutation also generates the 15-b^e "gap" of 50% signal intensity 
(which is flanlced by signals of 100% intensity). For added redundancy with heterozygotes 
samples, autisense P*s can be utiUzed (not shown). An unknown single-base substitution can be 
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detennined by combiaation of the two sets of P*s. Small deletions and iBsertions can be detected 
and localized. 

[0174] With 100,000 ohgonucleotides per microarray, about 12 kb can be resequeaced from 
downstream ani upstream directions. The detection of virtually all mutations requires 
supplementation of the standard Geniom® instrument softw^are. For wild-type sequence, the 
signal intensities may vary. Certain oligonucleotides will generate a weaker signal due to 
secondary structure and other factors. The pattern of signal firom wild-type samples should be 
distinguished rehably from the pattern generated by a given sequence change. The preliminary 
data suggest that almost all niismatches wiU inhibit activation dramatically. Because of the 
redundancy, mutations may be reliably distinguished fom the wild-type even if a sigrdficaot 
minority of single base mismatches does not inhibit activation substantially. 
{01751 It is becoming increasingly apparent that in vivo chromatin structure is crucial for 
mamnaalian gene regulation and development. Stable changes in chiomatin structure often 
involve changes in methylation and/or cihanges in histone acetylation. Somatically heritable 
changes in dnomatin structure are commonly called epigenetic changes (Russo and Riggs, 
1996) and it is now clear that epigeaetic "mistakes" or epimujations are frequently an important 
contributmg factor to the developmrait of cancer (Jones and Laird, 1999). 
[0176] One of Uie few methods for assaymg in vivo chromatin structure, and the only method 
with resolution at the single nucleotide level, is ligation-mediated PGR (LM-PCR) (Mueller and 
Wold, 1989; Pfeifer et al., 1989). LM-PCR has beeai used, to assess chromatin structure, 
' methylation aud damaged DNA. Fig. 5 shows a schematic of LM-PCR in which a DNA lesion 
in liie starting DNA is indicated by a small diamond. LM-PCR involves cleavage, primer 
extension, linker Hgation and PGR amplification. LM-PAP is similar to LM-PCR except that 
activatable oligonucleotide P*s are used. 

[01771 LM-PCR has proven to be an important technique, now having been used in over 100 
published studies (Pfeifer et al., 1999). Many aspects of chromatin structure can be determined 
by LM-PCR, such as the location of methylated cytosine residues, bound transcription factors, 
or positioned nucleosomcs. Importantly, the structure is determined in cells that are intact and 
have been minimally perturbed. UV photo-footprinting, for example, is performed by UV 
irradiating tissue culture cells in a Petri dish, immediately exfaiacting the DNA, and performing 
LM-PCR to determine the location of thymidine dimers, the formation of which is affected by 
bound transcription factors. 
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[0178] Allele-specific LM-PAP can be appHed to quantitatively detennine the level of vivo 
methylaiion. The background of LM-PCR currently limits reliable estimation of the level of 
methylatioa It is generally considered that 0 %, 50 % and 100% methylation can be determined, 
but distinguishing finer gradations is not reliable. With a marked reduction in background in 
LM-PAP, 0%, 20%, 40%, 60%, 80%, and 100% methylation standards may be distinguished 
reliably. It will be of particnlar interest to utilize allele-specific LM-PAP to examine the level of 
methylation in imprinted regions, or in active verses inactive X-chromosomal genes in females. 
It is anticipated that LM-PAP will decrease the skill and experience needed to exanaine 
chromatin stiiicture, thereby facihtating analysis of chromatin structure by more laboratories. 
[0179] LM-PAP has a diversity of applications. It will be of particular interest to utilize allele- 
specific PAP to examine difFereaiitial methylation and clironiatin structure in imprinted genes or 
in active versus iaactive X chromosomal genes in females. In addition, the relationshq) betweeaa 
mutagens, DNA damage^ and mutagenesis can be examined. 

[0180] In PAPj as described above and illustrated herein, pytpphosphorolysis and 
polymerization by DNA polym^ase are coupled serially by using pyrophosphorolysis 
acUvatable oligonucleotide. In PAP sequencing, the principle is based on flie specificity of PAP 
and in turn on the base pairing specificity of the 3' specific subsequence. This property of the 3' 
specific subsequence can be applied to scan for unknown sequence variants, to detennine de 
novo DNA sequence, 1o compare two DNA sequences and to monitor gene esxpression profiling. 
[0181] PAP is highly sensitive to mismatches along the length of P* in PAP with one P* and 
one opposing unclocked oligonucleotide. The specificity of PAP is also affected by P* length 
and mismatch. If the aUele-specific nucleotide of P* is at the 3' terminus, only the specific allele 
is amplified and lie specificity is not associated with P* length. If the allele-specific nucleotide 
is not at the 3' tamoimis of P*, the specificity is associated with P* length. 26 mer P* has a 3' 
specific subsequence of three-nucleotides within this region any mKmatch inhibit the 
amplification. 18-mer has a 3' specific subsequence of 16 nucleotides. 

[0182] Bi-PAP is a form of PAI\ In Bi-PAP with two opposing P^s, each P* has its own 3' 
subsequence, i.e., within this region any mismatch inhibit the atnphfication of Bi-PAP. For 
example, when the allele-specific nucleotide of the P* pair is at their 3' termini, only the specific 
allele was ampHfied, no matter what the lengths of the P*s are 40, 35 or 30 nucleotides. The 
length of the paired specific subsequence is addition of the P* pair mitius one. 
[0183] The length of the paired specific subsequeace may be affected by the sequence context 
and size of each P*, the type of the 3' terminal non-extendible nucleotide, the template sequence, 
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the DNA polionerase, otiier components like ions, and cycling conditions. When the template 
contains repeated sequences or homogenous polymer runs longer tiian the length of the P* pair, 
P* may lose specificily for anchoring. 

[0184J Resequencing is the sequencing of a laiown region to detect msknown mutations. The 
5 property of the paired specific subsequence of Bi-PAP can be applied to scamting for unknown 
sequence variants or re-sequencing of predetermined sequences in a parallel way. 
[0185] A Bi~PAP resequendng is shown in Figs. 22, 23A, 23B, 24A and 23B. Briefly, the wild- 
type sequence can be determined, and any single base substitution can be determined with the 
type and position. An unknown small deletion and insertion can be detected and localized- In 
10 order to identify a specific type of deletion or insertion, it is possible to add corresponding Bi- 
PAPs. For fingerprinting, which can provide information regarding noutation position, fewer 
uianbers of Bi-PAPs can be used. 

[0186] The concq)t of Bi-PAP dte novo DNA sequencing makes use of the complete set of 
paired specific subsequence of the P* pair to identify the pussence of the paired specific 

IS subsequence in the de novo sequence. 

[0187] Bi-PAP de novo DNA sequencing on microatxay is shoAvn in Fig. 25. BriejEly, the 
procedure first collects all the Bi-PAP ampIiEcations with tiieir P* pairs and then reconstructs 
the miknown DNA sequence fiom this collection by ordering the paired specific subsequences. 
[018S] For comparison of two DNA sequences to see if they are the same or different, there is a 

20 simple way to reduce the number of P* pairs by using m incomplete set of the specific 
subsequences of the P* pair. By arranging them in a particular order, it is possible to identify 
the chromosomal locations as well as the sequences. 

[0189] To monitor gene expression profiling, where up to 6x10* to 10^ transcripts are expressed 
and details of the precise sequence are lumecessaiy, Bi-PAP can be applied. A set of P* pairs 

2 5 which can specifically amplify unique motifs in genes can be designed for Bi-PAP, 

[0190] This property of the base pairing specificity of Bi-PAP can be applied to scan for 
unknown sequence variants, to determine de novo DNA sequence, to compare two DNA 
sequences and to monitor gene expression profiling. A Bi-PAP array is possible. Each pair of 
two opposing P*s can be immobilized at an individual spot on a solid support, e.g., microarray, 

3 0 thus allowing all the Bi-PAP reactions to be processed in parallel. 

[0191] For PAP, the activatable ohgonucleotide has a non-essteaidible 3' terminus that is 
aotivatable by pyrophosphorolysis (hereinafter referred to as a non-extendible 3' temiinus). Any 
3' terminal non-extendible oligonucleotide can be used, if it can hybridize wiih the ten^late 
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steand, the 3' terminal nucleotide can be removed by pyropbosphorolysis, and the activated 
oligonucleotide can be extended. Examples of non-extendible 3' terminus include, but are not 
limited to, a non-extendible 3' deoxynucleotide, such as a dideoxynucleotide, or a chemically 
modified nucleotide lacking the 3' hydroxyl group, such as an acyclonucleotide. 
Acyclonucleotides substitute a 2-hydroxyethoxymetiiyl group for the 2'-deoxyiibofliraiiosyI 
sugar normally present in dNMPs. 

[0192] Altemative blocking agents may increase the selectivity of pyrophosphoroloysis for a 
complete match, thereby further enhancing the selectivity of PAP for detecting rare mutations. 
Finally, alternative blocking ageaats may be less expensive or more readily automatable, thereby 
improving the cost-effectiveness of PAP and fedlitating PAP microarray-based reseqiiencing. 
[0193] In addition, P*s not blocked wiHi dideoxynucleotides extends the selection of DNA 
polymerases which can be used for PAP, As demonstrated herein. Family I polymerases may be 
used for PAP when the 3' temiinal non-extendible oligonucleotide contains a dideoxynucleotide 
or an acyclonucleotide. Family II polymerases may be used for PAP when (he 3' tenninal non- 
extendible ohgonucleotide contains an acyclonucleotide. 

EXANdPLES 

[0194] The invention can be understood from the following Examples, wMch illustrate that PAP 
can be used to identify a known mutation in a polymorphic site within the human Di dopamine 
receptor gene. The effects of the dideoxyoHgonucleotide sequences, acyclonucleotide 
sequences, DNA polymerases, PPj concentrations, allele-specific templates, pH, and dNTP 
concentrations were examined. The experiments reported in the Examples were conducted for 
proof of principle. The following examples are offered by way of iliustration and are not 
intended to limit the invention in any maimer. Standard techniques well known in the art or the 
techni(jies specifically described therein were utilized. 

EXAMPLE 1 
Preparatioii of template by PGR 
[0195] A 640-bp region of the human Di dopamine receptor gene was amplified by PGR with 
two primers (T = 5' GAG CTG GAG GAA GGG AGT GAG AAG 3' (SEQ E) N0:1) and U = 5' 
TCA TAG GGG AAA GGG GTG GAG ATA 3' (SEQ ID N0:2)) (Fig. 6A). The TUOT 
duplexed product spans nucleotides 33 to 672 in GenBanlc X55760 and the G+C content is 
55.3%. A common A to G polymorphism is located at nucleotide 229, resulting in three 
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genotypes of G/G, A/A and G/A (Liu et al., 1995). The PGR mixture contains a volume of 50 
fil: 50 mM KCl, lOmM Tris/HCl, pH 8.3, 1.5 doM MgClz, 200 each of the four dNTPs 
(Boehringer Mannheim), 0.1 liM of each primer, 2% DMSO, 1 U of Tag DNA polymerase 
(Boehringer Mannheim) and 250 ng of genomic DNA from G/G homozygote, AJA homozygote 
or G/A heterozygotes. Cycling conditions included: denaturation at 95 °C for 15 seconds, 
annealing at 55 "C for 30 seconds, and elongation at 72°C for one minute, for a total of 35 cycles 
(Perkin-Elmer GeneAmp PGR system 9600). The PGR product was purified &om primers and 
other small molecules by approximately 10,000-fold by three times of retention on a Centricon® 
100 microconcentrator (Amicon). The amount of recovered PGR product was determined by 
XTV absorbance at 260 nm. 

Synthesis of P* by adding a 3'-dideoxynucleotide 
[0196] The deoxynucleotide oligonucleotide was synthesized by Perseptive Biosystems 8909 
Synthesizer CFraminsham) and purified by oligopure cartridges (Hamilton) in the Gty of Hope 
DNA/RNA Chemistry Laboratory. The 3' terminal dideoxynucleotide was added by terminal 
transferase. The mixture oontamed a total volume of 40 200 mM potassium cacodylate, 25 
mM Tris/HCl (pH 6.6 at 25 °C), 2.5 mM C0CI2, 0.25 mg/ml of BSA, 4000 pM of the 
oUgonucleotide, 2.5mM 2'3'-ddNTP (the molar ratio of the 3'-OH termmus to ddNTP was 1:25) 
Bcehringer Mfflinheim), 125 U oftermmal transferase (Boehringer Mannheim). The reaction 
was incubated at 37°C for 1 hour and theaa stopped by addmg EDTA at 5 mM final 
concentration. After desalting by using butanol, the dideoxyoHgonucleotide was purified by 
preparative 7M urea/20% polyacrylamide gel electrophoreais in TEE buffer {90 mM Tris/borate, 
ImM BDTA, pH 8.3) (Maniatis et al., 1982). The amount of the recovered P* was detemimed 
by UV absorbance at 260 nm. 

[0197] Smce small amounts of untermmated oligonucleotide would result in non-specificity of 
pyrophosphorolysis, each dideoxyohgonucleotide was ^^P-labeled at the 5' temiinus by T4 
polynucleotide kmase and then was electrophoresed through a 7M urea/20% polyacrylamide gel. 
Only P* products were visible even when the gel was overexposed (data not shown). It is 
estimated that more than 99.99% of P* contained a dideoxynucleotide at the 3' terminus. 

Pyrophosphorolysis activated polymerization 
[0198] A 469-bp region within the TU:UT diq)lexed template was amphfied by PAP with 
ohgonucleotides P* and U, or with only one P* (Table 1 and Fig. 6A). The PU:UP duplexed 
product coiresponds to nucleotides 204 to 672 m GtsnBank X55760 and the G+C conlent is 
55.6%. Unless stated, the PAP reaction mixture contained a total volume of 25 pi for 2)2 DNA 
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polymerase; 75 mM KCl, 20 mM Tris/HCl (pH 7.4), 1.5 mM MgCk, 40 ^iM each of the four 
DNTPs (dATP, dTTP, dGTP and dCTP), 0.2 pM P*, 0.05 iiM U oligonucleotide, 300 pM 
Na4PPi (the 20 MM stock solution was adjusted hy HCl to pH 8.0), 1 \iCi of [a-^^P]-dCTP 
(3000Ci/nmole, Amersham), 1 U of 7^7 DNA polymerase (Promega) and 2 ng of TU:UT. For 
5 Taq DNA polymerase, the reaction mixture was the same except for 50 mM KCl, 10 mM 
Tris/HCl (pH 7.4), 2.0 mM MgCfe and 1 U of Taq DNA polymerase (Boehringer Mannheim). 
The mixtures of PCR and other controls were the same except for the primers added. Cycling 
conditions included: 94°C for 15 seconds, 55°C for one minute, ramping to 72°C for one 
minute and 72°C for two minutes, for a total of 15 cycles. 



TABLE 1 
Oligonucleotides used in PAP 



Tem- 
plate 


G 

5" . . .AATCTGACTGACCCCTATTCCCTGCTT GGAAC-.S' (SEQ ID NO: 3) 
A 


Name 


Oligonucleotide sequence 5'-3' (SEQ ID NO:) 


Purpose 


Di 


ACTGACCCCTATTCCCTGCTT*' ( 4 ) 


Control 




ACTGACCCCTATTCCCTGCTTG"" (5) 


3' ddG and G allele specificity 
co-localized 


DiG' 


ACTGACCCCTATTCCCTGCTTGG* (6) 


G allele specificity 5' to ddG 


D3G' 


ACTGacCCCTATTCCCTGCTTGGG* (7) 


G allele specificity 5' to ddG 


D4G' 


ACTGACCCCTATTCCCTGCTTiSGGG* ( B ) 


3' ddG mkthatches template 


DjG' 


TCTGACTGACCCCTATTCCCTGCTTG* ( 9 ) 


DiG', with 5' extended bases 




TGACTGACCCCTATTCCCTGCTTA* (10) 


3' ddA and A allele-specificity 
co-localized 


U 


TCATACCGGAAAGGGCTGGAGATA (11), 


Upstream oligonucleotide 


Name 


3' tenninal 
nucleotide" 


AUele-specific 
nucleotide^ 


Size 
(base) 


T 

c 




Amphfication' 


Type 


Match 


Type 


From 3' 
terminus (bp) 




G allele 


A allele 


Di 


dT 


Yes 




+1 


21 


64 


Yes 


Yes . 


D,G' 


ddG 


Yes 


G 


0 


22 


68 


No 


No 


D2G' 


ddG 


Yes 


G 


-1 


23 


72 


No 


No 


DaG' 


ddG 


Yes 


G 


-2 


24 


76 


Yes 


No 


D4G* 


ddG 


No 


G 


-3 


25 


80 


No 


No 


DjA' 


ddG 


Yes 


G 


0 


26 


80 


Yes 


No 


DeA' 


ddA 


Yes 


A 


0 


24 


72 


No 


No 
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|U |dA 1 




1" 1 


72 |Yes |Yes ~] 


° DiG' was prpduQcdby add 


iog a G dideoxjTHJcleotic 


le to tiic 3 


' termiaus of tiiie Di, =a 



dideoxynucleotide at the 3' tEnninus. 

The T means -the 3' temmms is T deoxynucleotide and G* means (he 3' terminus is G 
dideoxynucleotide. The bold coital GandAaretheGandA bases corresponding to G and A 
alleles, rrapectively. The first base at the 5' terminus corresponds to nucleotide 208 in GenBank 
X55760. 

" The 3' terminal base is a deoxynucleotide or dideoxynucleotide, and creates a match (Yes) or a 
mismatch (No) with the corresponding base on the complementary strand of the template. 

^ The allele-specific nucleotide is G or A and its distance to the 3' terminus is assigned: 0 = at the 
3' terminus +1 = one base downstream firom the 3' termiuus, -1 = one base upstream, from the 
3' terminus, -2 = two bases upsfream from the 3' terminus, and -3 = three bases upstream from the 
3' terminus. 

' The Tn, for oligonucleotides was estimated to be 4°C X (G + C) + 2<'C X (T + A) at 1 M NaQ 
(Miyada and Wallace, 1987). 
^ The amplification with U and one P* or with only one P*. 

[0199] The reaction was electropboresed through a standard 2% agaiose gel. The gel was 
stained with ethidium bromide for UV photography by a CCD camera (Bio-Rad Gel Doc 1000), 
dried and subjected to Kodak X-OMAT™ AR fihii for autoradiography. 
Restriction digestion 

[0200J Each of the three restriction endonucleases of Aal (S'C'CGCS'/S'GGC^GS') Eael 
(S'PyTGGCCPuSV 3TuCCGG*Py5') and Eco0109I (5'PuGTGNCCPy3'/ B'PyCCNG^GPuS') has 
a restriction site within the PU:UP duplex. The G/G alleles were amphfied by PAP with D5G* 
and U; PGR amplification with Di and U was used as the control. 40 jil of the PAP reaction and 
2 ^1 of the PGR reaction were purified and concentrated wilii a Centricon® 100 
microconcentrator, and the products digested by the restriction endonucleaser 2.5 U of AdL in 
IX NE buffer 3; or 3 U of Eael in IX NE buffer 1; or 30 U of ^^coOlOOT in NE buffer 4 with 
BSA {all of the above enzymes and buffers from New England BioLabs). 10 ^ of the teaction 
was incubated at 37 °C for 2 hours. The digestion reaction was electrophoresed through a 
standard 2% agarose gel as described abQve. 
Principle of PAP 

[0201] Tfl and Tag DNA polymerases were shown to contain pyrophosphorolysis activity, lyi 
DNA polymerase was utilized to detect the G allele at nucleotide 229 of the Di dopamine 
receptor geaie (Liu et aL, 1995) (Fig. 6A). P* was synthesized with either ddG or ddA at the 
3'tecmnms (see Table 1). The S'temiinal dideoxynucleotide inhibits direct extension by 
polymerization, but can be removed by pyrophosphorolysis in the presence of pyrophosphate 
(PPi) -vrfien the P* is specifically hybridized with the complementary strand of the G allele. The 
d^raded oligonucleotide can be extended by polymerization in 5'-3'direction (Figs. 6B and 6C). 
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[0202] The enhanced specificity of PAP relative to PASA is provided by seriaUy coupling 
pyrophosphorolysis and polymerization. Signijacajit nonspecific amplification requires 
mismatcli pyrophosphorolysis and misincorporation by DNA polymerase, an exixemely rare 
event (Fig. 7). 

Specific ampHfication with D5G* and D3G* 
[0203] PAP was performed with two oligonucleotides (P* and U), Tfl DNA polymerase and 
DNA template of the G/G and A/A alleles. Multiple P* were tested (Table 1). D5G* (the allel&- 
specific nucleotide and didepxynucleotide are co-localized to the 3' terminus and D3G* (the 
aUele-specific nucleotide is two bases firom tlie 3' tenninus) specifically amplified tie G allele in 
the presence of PP,- (Fig. 8A). Without added PPj, no specific product was observed with D5G*, 
indicating lhat added PPj was an essential component for PAP (Fig. 8B, lanes 6 and 15). Faint 
products with D3G* in lane 4 and with D4G* in lane 5 were observed (Fig. 8B) (see below). 

Effects of pH, [PPi] and [dNIP] and enzyme 
[0204] Each of the above parameters was examined. PAP was most efficient at pH between 7.4 
and 7.7, at [PP,-] between 200 |iM and 400 nM, and at [dNTPs] between 25 pM and 50 
(Table 2). Taq DNA polymerase can substitute for Tfl with similar efficiencies (Table 2). 

TABLE 2 



Parameters affecting PAP 





Parameter 


PAP efficiency" 






D5G*-U 


D3G*-U 




8.1 






7.9 








7.7 


++ 


+++ 




7.5 


++ 


-H-+ 




7.4 


-H- 


-H-+ 




7.15 


+ 


■f 


PPi" 

(m 


1000 






800 




■ + 


600 




++ 




400 


++ 


+++ 




200 


++ 


+++ 




0 




± 


All dNTPs 
changed^ 


200 




+ 


100 




+ 
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50 


++ 


■H-f 


25 


++ 


++++ 


dGTP 
changed^" 


100 


± 


++ 


50 


+ 


++ 


25 


± 


++ 


dATP 


100 




+ 


50 




+ 


25 




++ 


polymerase 


G allele and PP; 


++ 


-W-l- 


A allele and PP,- 






G allele and no PPj 




± 


Tfi DNA polymerase was iKed to amplif 


y ttie G/G alleles under the conditions in 



Materials and Methods, except for the factors indicated 

The PAP efficiency is indicated as: no specific product(s); +, very weak specific 
product(s); +, weak specific product(s); ++, moderate specific product(s); +++, strong 
spccifio product(s); ++++, very strong specific product(s). 
" Hie indicated concentration was changed but the otfaears were kept at 200 ]M. 

Identity of specific prodncts 
[0205] In order to confirm tiie identity of Hie specific products, restriction esndonuclease 
digestion was perforcaed (Fig. 9). Each of fee feree restriction endonucleases of AcO, Eael and 
£'co0109 has a restriction site with the PU:UP duplex. The esxpected restriction fegments were 
found. Similar results were observed with D3G* and U. 

[0206] The specific products of PAP with D5G* and U revealed two specific bands on the 
agarose gel, le., PUrUP and UP; because U was mors ejfficient than D5G*, under our 
amplification conditions. In ordei- to confirm this, fee G/G alleles were amplified by PAP using 
Tfl DNA polymerase wife D5G* and U as previously. The products were denatured and 
electrorphoresed througji a denaturing polyacrylamide gel. Only one specific band in single- 
stranded form was obsearved, indicating that fee specific PAP products contain fee duplexed and 
single stranded segmeaats. Hie same result was observed wife D3G* and U. 
Linear PAP 

[0207] PAP was performed for liaear amplification wife only one P* fix>m fee G/G and A/A 
alleles in fee presence of PPj. The specific products of PAP were obtauied wife DsG* and wife 
D5G*, but not with fee ofeer P* (Fig. 10, lanes 4 and 6). The efficiency of P* was affected by 
fee oligonucleotide si2E, fee S'-teiminal dideoxynucleotide and fee position of fee allele-specific 
nucleotide. 
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[020S] Figs. 6A-6C show schematic of PAP. Fig. 6A. A duplexed DNA template TU:UT is 
amplified witii two oligonucleotides P* and U, Tfl DNA polymerase, dJ-lTPs, pyropliosphate and 
[a-^^P]-dCTP. P* = pyropliospliorolysis activatable oligonucleotide. In tiiis example P* is 
D5G* and TOUT is a 640-bp segment of the dopamine Di receptor gene. Fig. 6B. D5G* has a 
G dideoxynncleotide at the 3' terminus, and it is specific to the complementary strand of the G 
allele, but mismatches the A allele at the 3' termmus (Table 1). RemovaJ of the dideoxy G by 
pyrophosphorolysis is followed by polymerization for each amplification. Fig. 6C. 
Autoradiogrmn of PAP from the GIG, A/ A and G/A genotypes. When the G allele is present, 
the radioactively labeled specific products of 469 bases (diqjlex PU:UP and excess antiseose 
strand UP) are produced, since the low rate of pyrophosphorolysis by Ifi polymerase implies 
that oligonucleotide U has a much higher efficiency tiian oligonucleotide P*. Electrophoresis 
for a longer period separates PU:UP flrom UP. Other products of UT and UTiTU are indicated. 
Note that TU:UT derives from annealing of excess radioactively labeled UT with non- 
radioactively labeled TU orighial template. PAP was also performed wilh D3G* and U &om fbs 
GIG, A/A and G/A genotypes, and similar results were obtained. 

[0209] Figs, 7A-7B show enhanced specificity of PAP with D5G*, The specificity of PAP is 
compared with ftat of PASA to exponentially amplify a template pool of G and A alleles. Fig. 
7A, The specific miq)lification of PASA derives fixxm the high efficiency of p rime r extension 
when the ppmer matches the G allele. The nonspeeific amplification results fix)m mismatch 
extension firam the A allele. "When this occurs, it results in an efficiency substrate fat fiirther 
amplification. The ttiiclcness and position of the arrow represent the amplification efficiency in 
each cycle. Fig. 7B. The specific amplification of PAP from the G allele occurs at high 
efficiency. Two types of nonspecific amplifications originate from the A allele: (i) nonspecific 
anqjlification can occur at low efficiency by mismatch pyrophosphorolysis resulting in a A:T 
homo-duplex PU:UP product, which is not an efficient template for subsequent amplification; 
(ii) nonspecific amplification can occur at extremely low efficiency by both mismatch 
pyrophosphorolysis and misincorporation to produce a G:T lietero-duplex PU:UP product, but 
once it occurs, it provides an efficiency template for subsequent cimplification. A similar 
tendency of nonspecific amplifications is suggested for linear ampHfication by PAP with only 
D5G*. It should be noted that allele-specific nucleotide of P*, such as D3G*, may be near but 
not at the 3' terminus, hi that case nonspecific ampUfication of PAP requires both mismatch 
pyrophosphorolysis and mism^h extension. While both variations of PAP should have higher 
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specificity than. PASA, the highest specificity is predicted when the 3' temdnal dideoxy 
nucleotide is also the allele-specific nucleotide. 

[0210] Figs. 8A-8B show specific amplification with D5G* and D3G*. PAP was perfomed in 
the presence (Fig. 8A) or absence (Fig, SB) of added PPf wifli two oligonucleotides for 
exponeaitial ampMcation. The oligonucleotides are listed in Table 1. Extension coirtrols with 
only U identify the positions of 1U;UT and XJT. Extension controls with Di identify the 
position of PU. PGR controls of Di and U identify the positions of PU:UP and PU:UT. Only 
20% of the eixtension reaction with D] and the PGR reaction were loaded relative to other lanes. 
[0211] Fig. 9 shows resfaiction endonuclease digestion. To show specificity of PAP, samples 
from the experiment shown ia Fig. 8 were digested with AcH, £aeL and Eco0109l restriction 
endonncleases. Each enzyme has a restriction site within PU:UP. PAP amplified the G/G 
alleles with D5G* and U, and 5% of PGR reaction with Di and U were taken as control. AcS. 
produces a 236 bp and a 233 bp fiagments from PU:UP and a 407 bp and a 233 bp fragments 
from TU:UT. Eael produces a 289 hp and a 180 bp iiagments from PU:UP and a 460 bp and a 
180 bp fragments from TUiUT. 5c£>01091 produces a 348 bp and a 121 bp fragments from 
PU:UP and a 107 bp, a 412 bp and a 121 bp fragments from TU:UT. The arrows indicate the 
digestion products eixpected from FU:UP. 

[0212] Fig, 10 shows linear PAP. PAP was performed with only one P* in the presence of 
added PPj. 20% of the reaction with Di was loaded relative to other lanes (lanes 1 and 10). No 
= no oligonucleotide added. 

Enhanced specificity of PAP withDsG* 
[0213] Example 1 provides evidence that pyrophosphorolysis followed by polymerization may 
be used to increase the specificity of PASA. Significant nonspecific amplification requires the 
serial coupling of the two types of errors (Fig. 7). The mismatch pyrophosphorolysis rate to 
remove a mismatch deoxynucleotide at the 3' tenninus, expressed as the removal rate of an 
incorrect versus a correct dNMP, was reported at less than 10"^ for T7 DNA polymerase 
(Komberg and Baker, 1992; Wong et aL^ 1991). The misincoiporation rate to create a 
substitution rnutation by polymerization, expressed as the incorporation rate of ah incorrect 
versus a correct dNMP, was reported as to be 10"^ for T7 DNA polymerase and to be 10"* for 
E.coH DNA polymerase I (Komberg and Baker, 1992; Wong et al., 1991; Bebenek et al., 1990). 
Similar results were reported for Taq DNA polymerase and for 3-5' exbnucleaserdefident 
mutants of T7 DNA polymerase and E. coli DNA polymerase I (Komiberg and Baker, 1992; 
Wong et al., 1991; Eckert and Kunkel, 1990), The specificity dae to the (i) nonspecific 
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amplification in PAP with D5G* is estimated to be 10'^ per cycle, if the mismatcli 
pyrophosphorolysis rate of a dctNMP is the same as dNMP. The specificity due to the (ii) 
nonspecific amplification is estimated to be 3.3x10'", if the mismatch pyrophosphorolysis and 
the misincorporation are serially coupled. 

Essential components of PAP 
[0214] Each P* was tested by utilizmg Tfl or Taq DNA polymerases to amplify the G/G and 
A/A alleles.. The specific an^lification requires 1h,e prescaice of PP; and allele-specific template. 
In addition, the amplification efficiency is affected by the oligonucleotide size, the 3' terniinal 
dideojqmucleotide, the position of the allele-specific nucleotide relative to the 3' tenrnmis of P*. 
[0215] It is not clear why DiG* and D2G* did not geaierate the spedlic signals, but it may be 
related to a threshold stability of duplex between P* and the template. DeA*, which contains A 
dideoxynucleotide at the 3' terminus, did not generate the specific signal, which may be 
associated wifli different incorporation efficiencies of ddNTPs by polymesrization. Klenow 
firagment of E. coli DNA polymerase I, Taq DNA polymerase and /3^aq DNA polymerase 
incoiporate ddGTP more efficiently than oHifir ddNTPs (Sanga- et aL, 1977; Tabor and 
Richardson, 1995; Vander Horn et al., 1997). The rate of ddNTP incorporation also varies 
depending on the template sequence and can be 10-fold higher at some bases relative to oflieTS 
(Sanger et ai., 1977). Another possibility is that D5A* is shorter in size witti a lower T^. 
[021.6] In PAP without added PPj, very faint false signals were generated with D3G* and with 
D4G* (Fig. 8B). One possibility is that oligonucleotide dimers can form and trigger nonspecific 
pyrophosphorolysis of P* in later cycles after "cndo-" PPj is released from the by-polymcrization 
to generate UT. 3'terminal degraded D3G* and D4G* can be hybridized and extended as false 
signal. Oligonucleotide dimers were observed with D3G* and D4G*. Another possibility with 
D3G* is tiiat the specific pyrophosphorolysis can occur in later cycles after "endo-" PP; is 
released. A third possibility is that D3G* and D4G* were contcaninated by mimmal D3 and D4 
which were not fliUy added by G dideoxynucleotide at 3' termini. 

Comparison with other technologies 
[0217] A number of methods for enzymatic nucleic acid amplification in vitro have been 
developed and can be adapted to detect known sequence variants. These include polymerase 
chain reaction (PGR) (Saiki et al., 1985; SaiM et al., 1988), ligase chain reaction (IXR) 
(Landegren et al., 1988; Barany, 1991) and rollmg circle anqjlificatian (RCA) (Lizardi et al., 
1998; Baner et al., 1998). PAP is different in many ways: i) pyrophosphorolysis and 
polymaization are serially cot^led for each amplification, ii) there is at least one 
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dideoxyoligonucleotide for PAP. Other chemically modified nucleotides lacking the 3'- 
hydroxyl group at the 3' tenninus, such as acyclomcleotides can serve the same function (see 
Example 12 below), iii) one format is for hnear amplification and the other is for exponential 
amplification, iv) PP; is necessary for the amplification, v) significant nonspecific amplification 
requires both mismatch pyrophosphorolysis and misincorporation, vi) PAP caa detect known 
point mutations and greatly increase the specificity to detect an extremely rare mutant allele 
from the wild-type allele. 

[0218] The mechanistic basis is that two or more reactions are serially coupled for amplification 
with increased specificity. The key componeat of PAP is a pyipphosphoroiysis activatable 
oligonucleotide. The blocked 3" tearoinus in Ihrae experiments is a dideoxy nucleotide, but any 
non-extendible nucleotide susceptihle to pyrophosphorolysis could in principle be substituted 
Indeed, any enzyme that cleaves an oligonucleotide 5' to a mismatdi could serve the same 
function as pyrophosphorolysis activation. For esxample, a blocked ohgonucleotide including the 
methylated recognition sequence (such as G^ATC) is minealed to its target with the 
umnefhylated recognition sequence, then reshriction eadonuclease (such as Dpnl) can only 
cleave the me%lated site and. so activate the oligonucleotide foi extension. If a mismatch is 
located 5' to the cleavage site, significant nonspecific amplification requires the serial coupling 
of mismatch cleavage and a misincorporation, which is a rare event Activatable 
oligonucleotides may also be combined with "mimsequencing" primer extension. This may 
provide a more specific assay fijr detection of single base changes that might be particularly 
amenable to chip technology in which specificity can be a problem (Syvanen, 1999), 
Demonstration that PAP can occur in the linear format (Fig. 10) supports the feasibUity of this 
approach. 

[0219] Nucleoside triphosphates and 2'-deoxynucleoside triphosphates or their chemically 
modified versions may he used as substrates for muMpIe-nucIeotide extension by PAP, i.e., 
when one nucleotide is incorporated the extending strand can be further extended. 2',3'- 
dideoxymicleoside triphosphates or their chemically modified vereions that are terminators for 
further extension may be used for single-nucleotide extension, 2',3'-dideoxynucleoside 
triphosphates may be labeled with radioactivity or fluorescence dye for differentiaiiorn fix)m the 
3' temiinal dideoxynucleotide of oligonucleotide P*. Mixtures of nucleoside triphosphates or 2'- 
deoxyuucleotide triphosphates and 2',3'-dideoxynucleoside triphoq)hates may also be used. 
[0220] Jn PAP, specific nucleic acid sequence is produced by using the nucleic acid containing 
that sequence as a template. If the nucleic acid contains two strands, it is necessary to separate 
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the strands of the nucleic acid before it can be used as the template, either as a separate step or 
simultaneously. The strand separation can also be accomplished by any other suitable method 
including physical, chemical or en2ymatic means. 

[0221] When it is desired to produce more than one specific product from the original nucleic 
5 acid or mixture of nucleic acids, the appropriate number of diffeient oligonucleotides are 
utilized. For example, if two different specific products are to be produced exponentially, four 
oligonucleotides are utilized. Two of flie oligonejcleotides ( P*S:1) are specific for oiie of the 
specific nucleic acid sequences and the oflier two oligonucleotides ( P*^l) are specific for the 
second specific nucleic acid sequence. In this manner, each of the two different specific 

10 sequences can be produced ejsponentially by the present process. 

[0222] The DNA or RNA may be single- or double-stranded, may be a relatively pure species or 
a component of a nuxture of nucleic acids, and may be linear or circiilar. The nucleic acid or 
acids may be obtained &om any source, for example, fixim plasmid, from cloned DNA or RNA, 
or from natural DNA or RNA firm any source, including bacteria, yeast, viruses, and higher 

IS organisms such as plants or animals. DNA or RNA may be extracted from bloo4 tissue material 
such as chorionic villi or amniotic cells by a variety of techniques such as that described by 
Maniatis et al. (1982). 

[0223] The P* oligonucleotides are selected to be "substantially" complementary" to the 
different strands of each specific sequence to be amplified. Therefore, the P* oligonucleotide 

20 sequence need not reflect the exact sequence of the template. For example, a 
non-complementaiy nucleotide segment may be attached to the 5'-end of the P* oligonucleotide, 
with the remainder of the P* oligonucleotide sequence being complementary to the strand. 
Alternatively, non-complementaiy bases or longer sequences can be interspersed into the P* 
oligonucleotide, provided that the P* oligonucleotide sequence has sufficient complementarity 

25 with the sequence of the strand to be .miphfied to hybridize therewith and form a template for 
synthesis of the extension product of the oilier P* oligonucleotide. As used ia the claims, the 
term "complementary" should be understood to mean "substantially complementary," as 
discussed herein. 

[0224] Any specific nucleic acid sequence can be produced by the present process. It is only 
3 0 necessary that a sufficient number of bases at both ends of the sequence be known in sufficient 
detail so that two oligonucleotides can hybridize to different strands of the desired sequence at 
relative positions along the sequence. The greater the knowledge about the bases at both ends of 
the sequence. Hie greater can be the specificity of the oligonucleotides for the target nucleic acid 
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sequence, and tlius the greater the efBciency of the process. It wiU be understood that the word 
oligonucleotide as u^ed hearemafter may refer to more than one oligonucleotide, particularly in 
the case where there is some ambiguity in the information regarding the tenmnal sequence(s) of 
the segment to be amplified. 

[022S] The present invention can be performed in a step-wise fashion where after each step new 
reagmts are added, or simultaaeousiy, where all reagents are added at the initial step, or partially 
step-wise and partially simultaneous, where fiesh reagent is added after a given nmnbca: of steps. 
The simuhaneous mettod may be utilized when an enzymatic means is used for the strand 
separation step. In the smiultaneous procedure, the reaction mixture may contain the 
strand-separating enzyme (e.g., helicase), an appropriate energy source for the strand-separating 
enzyme, such as ATP. Additional materials may be added as necessary. 
[0226J The nucleic add polymerase may be any compound or system that will fimction to 
accomplish the amplification. Suitable enzymes for this purpose include, for example, Tfl DNA 
polymerase, Taq DNA polymerase, E. coU DNA polymerase I, Klenow fiiagment ofE. coli DNA 
polymerase I, T4 DNA polymerase, T7 DNA polyma^e, other avaUsible DNA polymerases, 
RNA polymerases or their variants, reverse transcriptase or its variants, and otlier genetic 
engineered versions. It is predicted on the basis of the relationship between reverse and forward 
reactions that a DNA polymerase will have high and even pyrophosphorolysis activity for the P* 
activatable oligonucleotide, if it incorporates ddNTPs eflSciently (compared with dNTPs) and 
evenly (compared among the four ddNTPs), Of all the DNA polymerases, the genetic 
engineered version may be the best in the futare, such as TheimoSequenase (Vaader Horn et al., 
1997). Generally, the synthesis will be initiated at the 3' end of each oUgonuclcotide and proceed 
m the 5' direction on the template strand. However, inducing agents which initiate synthesis at 
the 5' end and proceed in the other direction can also be used in the PAP method as descrfljed 
above. 

EXAMPLE 2 
Preparation of template by PGR 
[0227] A 640-bp region of the human Di dopamine receptor gene was amplified by PGR with 
two primers (T = 5' GAG CTG GAG CAA GGG AGT GAG AAG 3' (SEQ ID N0:1) and U = 5' 
TCA TAG GGG AAA GGG GTG GAG ATA 3' (SEQ ID N0:2)). The TU:UT duplexed product 
spans nucleotides 33 to 672 in GenBank X55760 and the G+C content of the product is 55% . A 
common A to G polymorphism is located at nucleotide 229, resulting in three genotypes of G/G, 
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A/A and G/A. The PGR volume is 50 ixl: 50 mM KCI, 10 mM Tris/HCl, pH 8.3, 1.5 luM MgCk, 
200 each of the four dKTPs, 0.1 jxM of each primer, 2% DMSO, 1 U of Tag DNA 
polymerase (Boehringer Maimieim) and 250 ng of genomic DNA from G/G homozygote, A/A 
homozygote or G/A heterozygotes. Cycling conditions included: denaturation at 94°C for 15 
sec., annealing at 55°C for 30 sec, and elongation at 72°C for one min., for a total of 35 cycles 
with a GeneAmp PGR System 9600 (Perkin-Ehner Applied Biosystems). The PGR product was 
purified from primers and other small molecules by approxunately 10,000-fold by three tunes of 
retention on a Centricons 100 microconcentrator (Amicon). The amount of recoverad PGR 
product was determined by UV absorbance at 260 nm. 

Synthesis of P* by adding a 3' dideoxytmcleotide 
[0228] The deoxynucleotide oligonucleotide was synthesized by Perseptive Biosystems 8909 
Synthesizer (Framinsham) and purified by oligopure cartridges (Hamilton) m the City of Hope 
DNA/RNA Ghermstty Laboratory. The 3' terminal dideoxynucleotide was added by teinmnal 
transferase. The mixture contained a total volume of 30 iil: 100 mM potassium cacodylate (pH 
7.2), 2.0 mM C0CI2, 02 mM DTT, 2500 pM of the oUgonucleotide, 2 mM T, S'-ddNTP (the 
molar ratio of the 3'-0H terminus to ddNTP was l:24)(Boehringer Mamiheim), ICQ U of 
terminal transferase (GIBCO BRL). The reaction was incubated at 37°C for 4 hr and then 
stopped by adding EDTA at 5 mM final concentration. After desalting using a Centii-spin™ 
column (Princeton Sq)arations), P* was purified by preparative 7 M urea/20% polyacrylamide 
gel electrophoresis in TBB buffer (90 mM Tris^orate, 1 mM EDTA, pH 8.3) (Maniatis et al., 
1982). The amount of the recovered P* was determined by UV absoibance at 260 nm. 
[0229] Since small amounte of unieatminated oligonucleotide would result in nonspecificity of 
pyrophosphorolysis, eacji P* w^ ^^^-labeled at the 5' terminus by T4 polynucleotide kinase and 
then was electrophoresed through a 7 M wea/20% polyacrylamide gel. Only P* products were 
visible even when the gel was overexposed. It is estimated that more than 99.99% of P* 
contained a dideoxynucleotide at the 3' tenninus. The purity of P* was supported by the absence 
of PGR product or PAP product at pH 8.3. 

Pyrophosphorolysis activated polymerization 
[0230] Regions from 445 to 469 bp within the TU:UT duplexed template were amplified by 
PAP with oligonucleotides P* and U, or with only P*. The PU:UP duplexed product 
corresponds to nucleotides 204-228 to 672 in GenBank X55760 and its G+C content is 56%. 
The PAP reaction mixture contained a total volume of 25 ^il: 50 mM KGl, 10 mM Tris/HGl (pH 
7.6), 1.5 mM MgGla, 100 fxM each of the four dNTPs (dATP, dTTP, dGTP and dCTP), 0.1 fiM 
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P*, 0.1 tiM U oligonucleotide (TCATACCGGAAAGGGCTGGAGATA (SEQ ID NO:2)), 300 
iiM Na4PPi, 2% DMSO, 1 ^Ci of [a-^¥] dCTP (SOOOCi/mmole, Amersham), 1 U of 
AmpWaqFS DNA polymerase (PE Applied Biosystetns) or 0.5 U of each of Amplira^FS and 
Tag DNA polymerases, and 10 ng of TU:UT. TlieiraoSequeiiase (Amersham Pharmacia) was 
also tested under the same cx)nditions except for 8U ThermoSequ^ase or 4U TheimoSequenase 
plus 0.5U Tag and 2.5mM MgCh. The cycling conditions included: denaturalion at 94''C for 10 
sec, amealing at 60X for 1 min. (at 55 °C for ThermoSequenase), and ebngation at 72''C for 2 
min., for a total of 15 cycles. 

[0231] The product was electrophoresed thror^h a standard 2% agarose gel. The gel was stained 
wiUi efthidium bromide for UV photography by a CCD camera (Bio-Rad Gel Doc 1000) and 
Multi-Analyst® software, dried and subjected to Kodak X-OMAT™ AR film for 
autoradiography. The PAP yield was quantitated with a Phosphorlmager with ImageQuant 
software (Molecular Dynamics) as the total number of pixels in the PGR band iniiras the 
background, indicated as a random unit. 

Enhanced PAP efficiency 
[0232} In Example 1, only tbe P* with ddG.at the 3' t^nninus was amplified using native Tfl or 
Taq DNA polymerase. Aniplira^FS and ThennDSequeoase DNA polymerases were found to 
achieve much higher PAP efBdeacy with much less discrimination against any kind of 
dideoxynucleotide (ddAMP, ddTMP, ddGMP or ddCMP) at the 3' terminus of P*. For example, 
P*(212)18G'' and P*(212)18A", which are 18-mers of the dopamme Di receptor gene but have 
ddGMP and ddAMP at the 3' teimim (Table 3), specifically ampHfied the G and A alleles, 
respectively. Their yield ratio was 1.4 (compare lanes 9 with 11 in Fig. UB), and so 
P*(212)18G° is estimated to be 4% more efficient per cycle than P*(212)18A°. Another 
F*(228)26A-'* = 5' TAGGAACTTGGGGGGTGTCAGAGCCG* 3' (SEQ ID N0:12), which is 
a 26-mer with ddCMP at the 3' terminus, was amplified as efficiently as a primer without 
ddCMP at the 3' terminus, and the yield was estimated to be increased 1,000 fold conq)ared with 
that by using Tfl or Taq. Moreover, PAP amplified segmente directly fixim human genomic 
DNA. 
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[0233] AmpMTaqFS has two mutatians compared witii native Tag, One imitation in the 5' 
nuclease domam eliminates 5'-3' exonuclease activity and Uie second mutation F667Y in the 
active site (Imiis and Gelfend, 1999). ThetmoSequenase has the same mutation F667Y in the 
active site but a deletion of the 5'-3' axonuclease domain (Tabor and Richardson, 1995; Van der 
Horn et al., 1997). They do not distinguish between dNTP and ddNTP for incorporation. The 
pyrophosphorolysis of ddNMPs, which is the reverse reaction, is supposed to be much higher 
and less discriminated by these enzymes. Although either AmpIir<2^FS or ThernioSequenase 
DNA polymerases used was formulated to contain a thermostable pyrophosphatase 
(manufacturers' instructions) that can hydrolyze PPj in the reaction so as to decrease PAP 
efficiency, PAP was still amplijaed under our conditions. AmpliJa^FS and TheimoSequenase 
DNA polymerases will work better in their pure form without the contaminated 
pyrophosphatase. 

The 3' specific subsequence of P* 
[0234] Various P*s were examined with different lengths and mismatches using ArapUTaqFS 
(Table 3). He effect of length and mismatch on PAP efiSciency is expressed as the relative yield 
(%) between two P*s of different lengths from the same template (Fig. 12), which varied fipm 
0.0% to 201.5% with each two to four less bases iti length. The specificity of PAP is also 
affected by P* length and mismatch (Table 3). The noise rate (%) is defined as the relative yield 
of -flie mismatch product to the match product, and a specific signal is scored with <10% noise 
rate. If the aUele-specific base of P* was at the 3' teaminus, only the specific allele was 
amplified and the specificity was not associated with P* length (Fig. 12A). If the allele-specific 
base was not at the 3' terminus of P*, flie specificity was associated with P* length. Any 
noii-3'-t«ininal mismatch in ttie 18-mer P*, which was up to 15 bases &om the 3' terminus, 
caused no amplification (Figs, t2B-12E), but even two such mismatches in the 26-mer P* 
caused non-specific amplification. 

[0235] The IS-mers were ftjrthcT examined using "stacked" P*s, which span the allele-specific 
base at different positions (Fig. 13 and Table 4). Hie noise rate (%) varied from 0.0% to 7.1%. 
The length of the 3' specific subsequence was ^ 13 bases. 
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TABLE 4 

PAP specificity with dififerently positioned P*s 



Name 


Sequence (SEQ ED NO :) 


Teanplate 


6 

5 ' GACTGACCCCTATTCCCTGCTT-GGAACTrGAGGGGTGTC . . . 3' (33) 
A 


P*(212) 180" 


5'ACCCCTATTCCCTGCTTG* (16) 


P*C212)18A" 


5'ACCCCTATTCCCTGCTTA* (34) 


P*(214)18A"' 


5'CCCTATTCCCTGCTTAGG* (24) 


P*(218) 18G-° 


5'TTCCCTGC::TGGGAACT* (35) 


P*(221) 18G-' 


5'CCCTGCTTGGGAACTTGA* (36) 


P*(224) 18G-'" 


5 ' TGCTTGGGAACTTGAGGG* (37) 


Name 


3' terminal 
dideoxy 


Allele-specific 
base 


Tm eC) 


Noise rate (%)" 


Type 


Dist 
ance 


Exponential 
PAP 


Linear PAP template 


P*(212)18G'' 


ddG 


G 


0 


56 


2.7 


0.0 


P*(212)18A" 


ddA 


A 


0 


54 


3.8 


1.1 


P*(214)18A"' 


ddG 


A 


'2 


56 


4.7 


0.0 


P*(218)lSff'' 


ddT 


G 


-6 


54 


0.0 


0.0 


P*(221)18G-' 


ddA 


G 


-9 


56 


1.7 


1.7 


P*(224)18G-" 


ddG 


G 


-12 


56 


7.1 


0.6 



with one P*. The noise rate of PAP (%) is the relative yield of the non-specific allele product to 
liie specific allele product. 

[0236] Similar results were obtained by using P*,s vvMoh match and mismatcli the G allele at 
different positions (Table 5). The noise rate with one mismatch was various firom 0.8% to 5.6%. 
The lenglii of the 3' specific subsequence was > 16 bases. The noise rate with two mismatches 
was 0% (compare lane 2 with lanes 1 0- 1 5 in Fig. 14). 
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[0237] Linear PAP was examined using only 18 mer P*s and higher specificity was observed 
with lower noise rate (Tables 4 and 5). Linear PAP takes a different mechanistic pathway in 
which, every non-specific product is generated firom die starting template which require 
mismatched pyrophosphorolysis with the 3' terminal mismatchjed P*, or both mismatched 
s pyrophosphorolysis and miamatched extension with the non-3' terminal mismatched P* . 

[0238] PASA was performed with 17-mer primers without adding a ddNMP at the 3' temiinus 
(see Tables 4 and 5). A mismatched 17-mer primer strongly amplified a nonspecific product 
with 30% noise rate when the mismatch was as near as 6 bases to 3' terminus, showing a much 
shorter 3' specific subsequence. Similar results were reported elsewhere previously (Sarkar et al., 
10 1990). 

[0239] In "summary, P* (1-length) has two subsequences: a 3' specific subsequence (n = the 
number of bases of the 3' specific subsequence <1) determiaes the specificity, i.e.,wifhin this 
region any mismatch to its complementary strand of the template results in no substantial 
amplification; and a 5' enhancer subsequence (m = the number of bases of 5' enhancer 

15 subsequence > 0) enhances the amplification efficiency. PAP specificity is co-determined by the 
base pairing specificity of the 3' specific subsequence, the pyrophosphorolysis specificity and 
the polymerization specificity. Thus, the base pairing specificity of the 3' spiectfic subsequence is 
a mimmum requirement of the PAP specificity, 

[0240] The length of the 3' specific subsequence of P* may be affected by the sequence context 
20 and size of tlie P*, the type of the 3' terminal dideoxynucleotide, the template sequence, the 
DNA polymerase, other components like ion, and cycling conditions, Whea the template 
contains repeated sequences > 1 or homogeneous polymer runs > 1, P* loses specificity for 
anchoring. The length of the 3' specific subsequence of P* may be affected by the sequence 
context and size of the P*, the type of the 3' terminal dideoxynucleotide, the template sequence, 
25 the DNA polymerase, other components hke ion, and cycling conditions. When the template 
contains repeated sequences > 1 or homogeneous polymer runs > 1.. P* loses specificity for 
anchoring. 

Scanning or Resequencing for Unknown Sequence Variante 
[0241] The property of the 3' specific subsequence of P* can be applied to scanning for 
3 0 unknown sequence variamts or re-sequenciog of predetermined sequences in a parallel way. Each 
nucleotide on the complementary strand of the predetennined sequence is queried by fbui: 
downstream P*s, such as 18-mers (Pig. 11), which have identical sequence except that at the 3' 
terminus, either ddAMP, ddTMP, ddGMP or ddCMP corresponds to the wild-type sequence and 
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the three possible single base substitutions. The number of P*s scarming the complementary 
strand of X bc^es is multiplication of 4 aad X, which is suitable for either exponential or linear 
PAP. The four downstream P*s can even be unmobilized on a single dot when ddAMP, ddTMP, 
ddGMP and ddCMP at the 3' teimini are labeled differently for differs ;ntiation, such as by four 
5 fluorescaice dyes. The amplification signal can thus be represented by intensity decrease of 
each dye when ddNMP is removed from P* by pyrophosphorolysis. One advantage of linear 
PAP is that the four ddNIPs can be used as substrates for single base extensions, with are 
labeled with difereut dyes for differentiation. 

[0242] Briefly, if only all the P*s corresponding the wild-type sequence are specifically 
10 amplified, the wild-type seqpience can be aiianged in order by analyzing overlaps. A P* wMi a 
single base substitution at the 3' tenninus is amplified at the position of hemi- or homo-point 
mutations. The mutation also creates a "gap" of no PAP signal, wliich spans a region of several 
successive nucleotides. For single base substitation, the gap size (bases) + 1 = the length of the 
3' specific subsequence, 

15 [0243] Furthermore, we can also scan the sense strand by deigning a second set of upstream 
P*s. An unknown single base substitution can be deteannined by oombination of the two sets of 
P*s, even in heterozygotes. An unknown small deletion and insertion can be detected and 
localized. In order to identify a specific type of deletion or inseirtion, it is possible to add 
corresponding P*s. For fingerprinting, which can provide information of mutation position, 

20 tiiere is a simple stacking way that the stacked region of each, two successive P*b < the 3' 
specific subsequQice on the array to reduce the nranber of P*s by up to n fold. 

Determination of de novo DNA sequence 
[0244] The concept of denovo DNA sequencing by PAP makes use of aE the possible 3' 
specific subsequences of P* to identify the presence of the 3' specific subsequence in de novo 

25 sequence. A complete set of the 3' specific subsequences of P* is 4". Each of the 3' specific 
subsequence has a complete subset of the 5' enhancer subsequence of 4^". For example, a 
complete set of 16-mer as the 3' specific subsequence and 2-mej- as the 5' enhancer subsequence 
can be mdicated as (A T, G, C)(A, T, G, C) N,6 = 4^1 

[0245] Briefly, the procedure first determines the list of all the specific PAP amplifications and 
30 then reconstructs the unknown DNA complementary seqneace from this list by ordering the 3' 
specific subsequences with the given length by using the Watson-Crick pairing rules. 
[0246] The assembly process is interrupted wherever a given 3" specific subsequence of P* is 
encountered two or more times. One of the fectors influencing the maximum sequencing length 



wo 03/095664 



PCT/US03/14556 



52 

is the length, of the 3' specific subsequence. The length of a raadom sequence that can be 
reconstructed unambiguously by a complete set of the 3' specific subsequence with the given 
lengtti is approximately the square root of the number of the 3' specific sequence in the complete 
set with >50% possibility that any given 3' specific subsequence is not encountered two or more 
5 times. Octamers of the 3' specific subsequence, of which there are 65,536, may be useful in iJie 
range up to 200 bases. Decaimcleotid.es, of which there are more than a million, may ansiyzs up 
to a Idlobase de novo sequence. 18 mer P*s containiog 16 mea: as the 3' specific subsequence, 
which complete set is 4*® of P*s, may sequesoce maxhnum 77,332 bases, 
[0247] When there is neigiibored known sequmce to design an opposite ohgonucleotide for 

10 PAP with two oligonucleotides. The maximum sequencing lengfli is mainly limited to the 
opposite oligomiclebtide, but not to the length of tiie 3' specific subsequence of P*, tmned 
Conditional ffe novo DNA sequencing. 

Other Applications for PAP 
[0248] For fingeipiintting which compares two DNA sequences to see if they are the same or 

15 different, (here is a simple way to reduce the number of P*s by using ah incomplete set of the 3' 
specific subsequences. By arranging them in a particular order, it is possible to identify the 
chromosomal locations as well as sequences. Considering the 3 x 10^ bp DNA in human 
genome, PAP with two ohgonucleotides is prefecred over PAP with only one P* to inctEase the 
specificity. 

20 [0249] To monitor gene expression profiling, where, up to 6 x 10'^ to 10^ transcripts are 
expressed and details of the precise sequence are unnecessary, PAP 'vvith only one P* can be 
applied and a set of P* which identify unique motifs in genes can be designed with a total length 
of up to 22- mer. Between eacb two P*s, there is at least a sequence difference at the 3' terminus 
or > 2 sequence differences at the non-3' terminus. 

2 5 Comparison with Sequence by Hybndi2a.tion 

[0250] In SBH by using oligonucleotide, the DNA sequence is determined by ttie hybridization 
and assembly of positively hybridizing probes thorough overlapping portions. It has been taiown 
for a long time that a single oligonucleotide hybridization on a iounobilized sample can be very 
specific in optimal Ifyfaridization and washing conditions (Wallace et al., 1979), thus it is 

30 possible to discriminate perfect hybrids ftom ones containing a single internal raismaiclL The 
oligonucleotides in array are 11-20 nucleotides in length and have 7-9 bases specific region in 
the middle, the non-specific signal is generated by mismatched hybridization. Under standard 
hybridization and washing conditions, the duplex stabilrfy between match and mismatch is also 
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affected by tlie tenninal mismatch and the flanking sequence (Drmanac et al., 1989; Khrapko et 
al., 1989; Ginot, 1997). 

[0251] SHB can be modified with enzymes in several ways (Miyada and Wallace, 1987; 
Southern, 1996). Primer extension by DNA polymerase incorporates bases one at a time only if 
they match the complement strand. Ligase has similar requirements: two oligonucleotide can be 
joined en2ymatically provided Ihey both are coniplementary to the template at the position of 
joining. 

[0252] Figs. ilA-llB show the enhancement of PAP efficiency. Fig. IIA. PAP is amplified 
with two oHgonueleotides P* and U from diqjlex TU:XJT template. Each of the four P*s has a 
ddA, ddT, ddG and ddC at the 3' temmms. The 3' terminal base is either specific to the 
complementary strand of the G or A alleles, or not matched. Fig. IIB. Autoradiograna of PAP 
from the G/G, A/A and G/A genotypes of the human dopamine receptor gene. The radioactively 
labeled specific products of 461 bases (duplex PU:UP and excess antisense strand UP) are 
produced. Other side products UT and UT:TU are indicated. Note that TUOT derives from 
annealing of excess radioactively labeled UT with non-radidactively labeled TU origiiial 
template. 

[0253] Figs. 12A42E show the effect of P* length and atismatch on PAP efficiency. PAP was 
amplified with P* and U oligonucleotide (see Table 3). In each of Figs. 12A-12E, P*s have the 
sample 3' termini but are different in length. Fig. 12A. In lanes 1-4, the P*s matched and 
amplified the G allele. In lanes 5-8, the P*s mismatched at the 3' teamini but amplified the A 
allele. Fig. 12B. In lanes 9-12, the P*s matched and amplified the G allele. In lanes 13-16, the 
P*s mismatched at -12 bases to the 3' termini but amplified the A allele. Fig. 12C. la lanes 
17-20, the P*s matched and amplified the A allele. In lanes 21-24, the P*s mismatched at -2 
bases to the 3' termini but anqtlified the G allele. Fig. 12D. In lanes 25-28, the P*s mismatched 
at -9 bases to the 3' termiai but amplified the A allele. Fig. 12E. Li lanes 29-32, the P*s 
mismatched at -15 bases to the 3' termini but amplified the A allele. The length effect is 
indicated as the yield ratio in one lane (Ln) to the previous lane (L^-i). The length effect was not 
shown in lanes 5-8 because the signals are at or close to the baekgromid. 
[0254] Fig. 13 shows PAP specificity with differently positioned P*s. PAP w^ ampliSed with 
a P* and U oligonucleotide (see Table 4). The P* matched to and amplified the G allele in lanes 
2-7, but mismatched to and amplified die A allele in lanes 9-15, Lanes 1 and 9 wrae PGR control 
wifhD!(212)17merandU. LanesS and 16 were exteaosion control wife only U. 
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[0255] Fig. 14 shows PAP specificity witii differently mismatched P*s. PAP was amplified 
with a P* aiid U oHgonucleotide (see Table 5). In lanes 2-7, the P* amplified the G allele with 
match or one mismatch. Iq lanes 9-15, the P* amplified the A wilh one or two mismatches. 
Lanes 1 and 9 were PGR control with D,(212)17 mer and U. Lanes 8 and 16 w^e extension 
control with only U. 



EXAMPLES 
PAP Aiiq)lification From Genomic DNA 
[0256] This example, illustrates PAP amplification directly from genomic DNA, The 
oligonucleotides used hi this example are listed below. Lane numbers refer to lanes in Fig. 15. 
(02571 Tlie downstream oligonucleotides in 0.1 fiM concentration are; 
Lane 1: Di(204)25D 5' TCTGACTGACCCCTATTCCCTGCTT 3' (SEQ ID NO:43) 

Lane 2: P*(206)24A'' 5' TGACTGACCCCTATTCCCTGCTTA* 3' (A allele specific; 

SEQIDNO:44) 

Lane3:P*(204)26G'' 5' TCTGACTGACCCCTATTCCCTGCTTG* 3' (G allele 

specific; SEQ ID NO:45) 
Lane 4: P*(206)24G-=' 5' ACTGACCCCTATTCCCTGCTTGGG* 3' (G allele specific; 

SEQIDNO:46) 

Lane5:P*(228)26A-^'* 5' TAGGAACTTGGGGGGTGTCAGAGCCC* 3' (A allele 

specific; SEQ ID NO:47) 
[0258J The opposite upslieiim oHgonucleotide in O.l pM ccncenlTation is: Di(420)24U 
5' ACGGCAGCACAGACCAGCGTGTTC 3' (SEQ ID NO:48), which was pahed with each 
downstream oligonucleotide. See Footnotes of Table 3 for details. 

[0259] The other components were the same as in Example 2, except for the followiag: 0.5 U of 
each of Amphla^FS and Tag DNA polymerases, and 100 ng of heterozygous G/A allelic 
genomic DNA were used per 25 |il reaction by using 30 cycles. 

[0260] The PAP product size range from 193bp to 218 bp. One double stranded and one smgle 
stranded product was observed on the gel, indicating the exhaust of PPf hydrolyzed by the 
contaminated thermostable pyrophosphatase. 
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EXAMPLE 4 
Comparison of Specificity of LM-PCR and LM-PAP 
[0261] The LM-PCR protocol includes primer extension, linker ligation, PCR amplification, and 
directed labeling iii the human dopamine D| receptor gene model system (Fig. 16). LM-PCR 
was perfonned with the addition by terminal deoxynucleotidyl transferase (TdT) (Ms protocol is 
known as TD-PCR) on UV-treated genomic DNA samples essentially as described (Pfeifer et 
al., 1999), except that VeniR (exo-) DNA polymerase was used in the first 10 cycles of primer 
extension (PI primer = 5' TTGCCACTCAAGCGGTCCTCTCAT 3' (SEQ BD NO:49)). 
TeD^rature cycles were 1 min at 95° C, 3 min. at 63° C, and 3 min at 72" C. To oihaQce the 
signal, terminal trmsferase was added to the protocol, and this variation of LM-PCR is called 
TD-PCR. Dynabeads were used to enrich target DNA molecules before terminal 
deoxynucleotidyl transferase (TdT) tailing. PCR was perfbimed using Ejcpand Long Template 
PGR System 3 (BMB) as described by the manufacturer CP2 primer = 5' GAAGCAATCTGGCT 
GTGCAAAGTC 3* (SEQ ID NaSO)). The PGR products were purified using QIAquick PCR 
Purification Kit (QIAGEN) before perfonning the direct labeling. A portion of the cleaned PGR 
product was used for direct labeling with AmpUTdq DNA Ppjymerase (Peikin-Elmer) witli ^^P 
labeled primers: 

P3A: (5' TCTGACTGACCCCTATTCCCTGCTTA 3' (SEQ ID N0:51; the 3' terminal 

deoxynucleotide is A allele specific) and 
P3G: (5' TCTGACTGACCCCTATTCCCTGCrre 3' (SEQ ID NO:52; the 3' terminal 

deoxynucleotide is G allele specific). 
[0262] LM-PAP was perfonned as allele-specific PCR except for the direct labeling stqj by 
PAP (Fig. 16A). The purified PCR product was used for direct labeling with ^¥ labeled primers: 
P3A: 5' TCTGACTGACCCCTATTCCCTGCTTA* 3" (SEQ ID NO:53; the 3' terminal 

deoxynucleotide is A allele specific) and 
P3G: 5' TCTGACTGACCCCTATTCCCTGCTTG* 3' (SEQ ID NO:54; the 3' terminal 

deoxynucleotide is G allele specific) 
using PAP reaction conditions in a 10 [i\ volume (50 mM KCl, 10 mM Tris/HCl (pH 7.6),1.5 
mM MgCla. 100 of each dNTP, 0.1 \M P*, 300 loM Na4PPi, 2% DMSO, 0.25U each of 
AmplxTa^FS and Amp]iTag DNA Polymerases (PefMn-Ehner). The cycling conditions were 
94° C, 10 sec.; 60° C, 1 min. and 72° C, 2 min. for a total of 8 or 16 cycles. LM-PAP was 
dramatically more specific than LM-PCR The initial data with the dopamine Dl geae shows a 
lower background with LM-PAP than with the idenfical unblocked oligonucleotide witii LM- 
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PGR. Also, LM-PAP can be performed with the PGK gene, a gene with, a very high GC rich 
region(70%)(Fig. 16B). 

[0263] Fig. 16A shows a UV footprintiag of the dopamine Dl receptor gene with a comparison 
of aliele-specific LM-^PAP and aUele-specific LM-FCR. A direct comparison of LM-PAP with a 
P* and LM-PCR with an unblocked primer of identical sequence shows that two alleles can be 
distinguished with LM-PAP, but not with LM-PCR. Both methods were performed on HF-16 
DNA tiiat was untreated (C), in vitro treated (T) or in vivo treated (V) with UV. The direct 
labelhig reaction using PAP conditions (lanes 7-18) with ^^P labeled primers PSA* (lanes 7-9 
and 13-15) and P3G* (lanes 10-12 and 16-18) was done mihAmpliTagFS saAAmpUTaq for 8 
and 16 cycles. For LM-PCR tiie direct labeling reaction was done with AmpliTaq (lanes 1-6) 
and ^^P-labeled primers PSA (lanes 1-3) and P3G (laiies 4-6) for 8 cycles. Allelic primers P*s, 
PSA* and P3G* for LM-PAP clearly distinguish the two alleles, while unblocked allelic primers 
of identical sequence, P3A and P3G, were unable to distinguish the alleles by LM-PCR. 
10264) Fig. 16B shows a UV footprinting of the pgK gene. The LM-PAP procedure for POK 
was essentially the same as for the dopamine Dl receptor except that Pfii Turbo DNA 
polymerase was used in the primer extension, as well as 7-deaza-dGTP/dGTP in a 3;1 ra.tio. 
Tanperature cycles were 95° 1 min., 60° 2 mia, and 76° 3 rain. The PGR step was performed 
using Vent (exo-) DNA Polymerase at 97° 1 min., 60° 2 min., 76° 3 man. also with deaza dGTP. 
The purified PGR products were used for direct labeling with the ^^P P3G* and P3C* primers 
iKing PAP reaction conditions in a 25 pi volume (50 mM KCL, 20 mM Hepes, pH 6.95, 10 mM 
(NH4)2S04, 1.5 mM MgCfe, 40 pM dNTP, 150 yM Na4PPi, 4% DMSO, and 1 unit of AmpUTaq 
FS DNA Polymerase. The conditions for cycling were 94° 15 sec, 60° 30 sec., and 72° 1 ipin. 
for 10 cycles. 

EXAMPLE 5 

Optunization of PAP-A to Detect a Mutation in 1 of IC^-IO^ Templates 
[0265] One ng of lambda phage DNA contains 2 x lO'*' copies of template. The specificity of 
PAP is detennined by mixing one part mutant lad templates with IC^ to 10^ parts control DNA 
templates, e.g., wild-type lad. The specificity of PAP-A is a fiinction of the error rate of the 
polymerase, the purity of P* (<2 x 10"^ by current purification protocol) and the potential for 
damage of the DNA template in the extraction process. The yield and specificity of PAP is 
optimized by testing enzyme type and conceaitration and the concentrations of other 
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components, such as dNTP, PP,, Mg'^ or Mn""^. Hotstart PAP using andbody-activated enzyme, 
such as DNA polymeraso, at room temperature can be used to eliminate spurious ampHfications. 
[0266] Wild-type and mutant lambda phage DNA, which are used in the laboratory as a model 
system to study spontaneous mutation in mammals, are prepared from infected E. coli SCS-8 
5 cells (Nishino et al., 1996). The lambda phage is grown under high fidehty conditions amd DNA 
is isolated with care under conditions with low rates of DNA damage (Stratagene manual) 
(Nishnio et al., 1996; HiU et al., 1999). 

[0267j The mutants include one example of each of the two types of transitions, the four types 
of transversions and a one-base nucleotide deletion. P*s specific for each of the mutations is 

10 synthesized. These DNA templates' are used for reconstruction experiments in. which mutated 
DNA is serially diluted into wild-type DNA. The spiked samples are used to optimize PAP-A. 
The most nibust polymerases are chosen based on yield and specificity using TaqPS, 
ThennoSequeuase, and SequiTherm Excel II (Epicentre). Oflier components of the reaction are 
optimized systematically, including fhetmocycling parameters, oligonucleotide length, and 

15 reagent oonoentrations of PPj, dNTP and Mg"" or Mn^. Quantitative detection of the yield of 
PAP prodnct is achieved with autoradiography or fluorescence on a SSCP gel. These data aids m 
the optimization of PAP-R and LM-PAP (below). The optimization of these various parameters 
result in a specificity of 1 part in 10''-10^. 

[0268] The Gptunized conditions are also tested, for detecting mutations in the human fector IX 
20 gene by mixing human mutant genomic DNA tanplates with up to lO'* wild-type templates. As 
with the lambda experiment, exponential PAP is performed with appropriately designed 
oligonucleotides (using 01igo5 software) for 40 cycles and strong signal is achieved by 
autoradiography or by fluorescence detection. 

25 , EXAMPLES 

Optimization of PAP-R 

[0269] In a model system, mismatches along the length of P* inhibited activation, even when 
the mismatch is two nucleotides from the 5' end (Fig.l4). An additional set of 18 mers of P*s, 
whose 5' termini were displaced 2, 6, 9, and 12 nucleotides downstream, also showed inhibition 
30 of activation (Fig. 13). In addition, 20 and 22 mers also show inhibition with single nucleotide 
mismatches (Fig. 12). To extend these findings and to lay the foundation for a robust method of 
resequencing, the relationship between the location of single base mismatches and activation of 
P*s is analyzed fttrther. 
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[0270] The factor IX gene is used as a model system because more than 1,000 DNA samples 
from hemophilia patients and family members have been ascertained from previous work on the 
molecular epidemiology of germhne mutations in humans (Sommer, 1995; Ketterling et aL, 
1999). Two 20-nucleotide regions of exon B aad exon H in the human factor IX gene are used 
as model systems. The region of exon B is designed from nucleotides 6460 to 6479 (5' 
CGAGAAGTTTTTGAAAACAC 3' (SEQ ID NO:55;Yoshitake et al., 1985), within which eight 
different single base mutations aie available. The region of exon H is from nucleotides 30845 to 
30864 (5' GAACATACAGAGCAAAAGCG 3' (SEQ ID NO;56), wifiun which seven mutations 
at dijEferenl positions are available. P*s identical to wild-type regions B and H will be 
synthesized. Identical P*s are synlhesized, with the exception of a single nucleotide mismatch. 
[0271] The wild-type factor DC sequence is used in the initial studies. A few P*s that match the 
wild-tj^e sequeaice or that mismatch at selected sites within the 5' third of the oligonucleotide 
sequence are helpfiil in performing pilot experiments to assess the optimal length of the 
oligomicleotide. The effects of polymerases and reaction conditions can be assessed. 
[0272] Fioni preHrainary data, it appears that 18 mers or larger may be an optimal size. It is 
also possible that 25 mers or even 30 mers may be optimal. For the present example, it is 
assumed that 20 mers are an optimal size. Wild type P* and twenty P*s with one of the possible 
single base mismatches at each nucleotide of the position region of exon B are synthesized. 
Eight of these P* are a perfect match to a mutation in a patient with hemophilia B. As positive 
controls, it is shown that these P*s activate efficiently when the ^propriate mutated DNA 
sample is used. Exponential PAP and linear PAP are performed and the noise rate is determined. 
The noise rate for linear PAP is generally lower and is used 

[0273] To confirm preliminary data in another sequence context, a similar experiment is 
perfonned in exon H. The seven mutations ia that region of exon H are analyzed in a blinded 
manner to determine if the precise match is detected. The effects of the position of the mismatch 
or tlie type of mismatch on P* activation is determined. The effects of different polymerases, 
reaction temperature, and other reaction conditions can also be determined. Another set of 20 
P*s provides additional data from mismatches 12-20 nucleotides from the 3' terminus. 

EXAMPLE 8 
OptunizatioaiofLM-PAP . 
[0274] The human dopamine Di receptor gene and the mouse PgU gene are used as model 
systems to compare the analysis of chromatin stmcture when LM-PAP or LM-PCR is utilized 
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The dopamine Di receptor gene has been described above. X chromosome inactivation occurs at 
an eaiiy embryonic stage. Since the two alleles in female cells maintain a different expression 
status, this is an advantageous system for studies of gene regulation. PgM is an X-linked 
housekeeping gene encoding phosphoglycerdte kinase (PGK). PGK is an important enzyme in 
glycolysis and the gene is expected to be active all the time except in the inactive X 
chromosome (Xi) of fatnale somatic cells and in male germ cells. 

10275] The prehminaty data shows a dramatic enhancement of specificity with LM-PAP relative 
to LM-PCR in the dopamine Di receptor gene, a gene not previously analyzed for chromatin 
structure (Fig. 16 A). In this example, LM-PAP and LM-PCR are praformed. Three sets of 
oligomicleotides that generated LM-PCR profiles and seven sets of primers that generated LM- 
PCR profiles with unacceptable background in the Pgkl (and other X-chromosomal genes) are 
nsed to compare IM-PAP with LM-PCR. Deoxy-terminating and dideoxy-teaminadng 
oligomicleotides of identic^ sequence are utilized to perform LM-PAP and LM-PCR, 
respectively. The level of signal relative to background is also quantitated by a Phospholmager. 
The average signal-to-noise ratio is determined. Optimization data derived ftom analyses with 
PAP-A and PAP-R are also useful in the L^yi-PAP protocol. LM-PAP is optimized for the two 
regions to detennine if the signal-to-noise ratio can be reduced fifffher. 

EXAMPLE 8 
Optimization of AUele-Specific LM-PAP 
[0276] Polymorphic sites of pgkla and lb gene in both coding and non-coding regions have 
been reported (Boer et aL, 1990). These are used to design the allele-specific P*. One allele- 
specific oligonucleotide is chosen prospectively fiom the Pgkl gene and one is chosen 
prospectively ftom Ihe dopamine Di receptor gene. Blocked and unblocked ohgonucleotides of 
identical sequence are synthesized and allele-specific LM-PAP and LM-PCR are performed, 
respectively. The signal to noise ratio is quantitated and compared. 

EXAMPLE 9 
PAP-R on a Microaxray 

[0277] The uiitial experiment will focus on the two 20 nucleotide regions of exons B and H as 
described above. The experimental design of PAP-R is sknilar to the eacperiments described 
above, except for digital Hght-direct synthesis of P* oligonucleotides on a microarray, e.g., with 
the Geniom® instrument A total of 160 oligonucleotides are synthesized complementaiy to 
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wild-type and to all the single base mismatches for 20 bp regions of exons B and H of the factor 
IX gene. As a positive control, 160 oligonucleotides, each out of registered by one nucleotide, 
are synthesized to match exactly an adjacent 160 bp region of the factor IX gene. Genomic DNA 
from wild-type and mutant samples is amplified, annealed to the oligonucleotides and primer 
extension will be performed with a fluorescent dideoxy terminator. The protocol is optimized for 
the solid support. Adjustment of primer length, enzyme utilized and reaction conditions is 
performed such that most, if not all, of the ohgonucleotides that mismatch the two 20 bp 
nucleotide regions of factor DC generate little if any signal, -wdiile most of the 160 control 
oligonucleotides generate a strong sigtiaL 

[0278] One strategy for resequeacing is shown in Figs. 3 and 4. Each nucleotide in tiie 
complementary strand of the predetermined sequence is queried by four downstream P*s, such 
as 20 mers, which have identical sequence except for the 3' terminus, which is either ddA, ddT, 
ddG or ddC. For a 1 kb segmeart, 4,000 P*s are needed in the downstream direction. In the 
second set of experiments, exons B and H of the fector IX gene are resequenced. Samples &om 
more than 200 patien[ts with different mutations in these regions are available for analjrsis. False 
positives and false negatives are assessed by bhnded analysis. Heterozygous female samples are 
available for many of the mutations. For the remaming. male patient samples one to one mixing 
experiments with wild-type or a second mutated sample generates the equivalent of 
heterozygotes or compound heterozygotes, respectively. Subsequently, all the regions of lilcely 
functional significance (the putative promoter region, the coding regions, and the splice 
junctions) are resequenced (2.2 kb). Since more than 600 independent mutations are available, it 
is possible to determine whether more than 99% of all sequence changes are identified (the 
sequence changes in these samples have been determined by diiBct sequencing over the course 
of a decade). 

[0279] A P* with a single base substitution at the 3' terminus generates a signal at the position 
of hemizygous or homozygous point mutations. The mutation also ci eates a "gap" of no PAP 
signal, which spans a region of several successive nucleotides. When a single base substitution 
occurs, the g^ size (nucleotides) + 1 = die length of the 3' specific subsequence (Figs. 3 and 4). 
[0280] To analyze samples with higher G4C content (55%), mutations in the lad gene are 
utilized. These mutations from the Big Blue® Transgenic Mouse Mutation D^ection System, 
have the potential to fecilitate the definition of a strategy fliat detects more than 99.9% of 
mutations, since more than 6,000 mutations are available in this system. The relevant regions are 
analyzed with the help of robotic devices. In addition, hundreds of mutations or polymorphisms 
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are available for analysis in other genes with. G+C contents of 30-75%. The dystrophin gene is 
particularly amenable to testiag performance under conditions in which megabases of sequence 
require scanning. In this gene m which 90 segments are amplified by a robotic device, virbially 
all sequence variants have been defined by DOVAM-S followed by DNA sequencing. This is 
s advantageous because many molecular epidemiological and moiecular diagnostic applications 
besnefit firom resequeaicing that detects virtually 1 00% of the mutations. 

EXAMPLE 10 

PAP Amplification Directly from Human and Mouse Genomic DNAs 

10 [0281] PAP was performed with each of two P*s, PI* (SEQ ID NO:45, G allele specific)) or 
P2* (SEQ ID NO:47, A allele specific) aad aa upstream unblocked primer (U; (SEQ ID NO:48) 
to amplify 180-bp segments of the Di dopamine gesae. The P* are 26-mers with ddC and ddG at 
the 3' tennini. 100 ng of human genomic DNA was amplified 35 cycles followed by 2% gel 
electrophoresis. The PAP reactian mixture contained a total volume of 25 jil: 50 mM KCI, 20 

15 mM HEPES/NaOH (pH 6.9 at 25 "C), 10 mM CNH4)2S04, 1.5 mM MgCh, 40 nM each of the 
four dNTPs (dATP, dTTP, dGTP, dCTP), 0.1 ]iM U, 150 ^iM NajPpi, 2% DMSO, 0.5 U of 
AmphTij^FS polymerase (PE Applied Biosjretems), 0.5 U Taq polymerase and 1 00 ng of human 
genomic DNA. The cyclmg conditions were 94 "C for 15 sec, 65 "C for 30 sec and 72 "C for 1 
min. Fig. 17A shpws the results for PAP amplification of the Di dopamine gene. In lanes 2 and 

20 5, PI* is specific for the A allele template at 24 nucleotides from the 3' terminus, so there is 
little or no discrimination between the G/G and A/A genotypes. In lanes 3 and 6, P2* is specific 
for the A allele template at 2 nucleotides from the 3' terminus, so there is specific amplification 
of the A/A genotype. Lanes 1 and 5 are PGR controls. Lanes 4 and 8 are negative controls 
without P*. Lane M is 120 ng cpx DNA/HAEEV marker. 

25 [0282] Three Bi-PAP assays were tested directly from mouse genonnc DNA. Bi-PAP was 
performed with two P*s containing a dideoxyaucleotide blocker at the 3' terminus to ampHfy an 
80-bp segment of the lad gene. The P*s are specific to the wild-type template and are 40-42 
nucleotides long. In each of the tbree Bi-PAP assays, two opposite P* with one nucleotide 
overlap at their 3' termini were used to amplify 400 copies of the lad geae using 35 cycles.. 

30 The sequences of the P*s are as follows: 

5' GAAGCGGCGTCGAAGCCTGTAAAGCGGCGGTGCACAATCT* 3' (SEQ ID 
NO:67) and 5' GCGGATAGTTAATGATCAGCCCACTGACGCGTTGCGCGAGAA* 3' 
(SEQ ID NO:68) in lanes 1 and 2; 
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5' GATGGCGGAGCTGAATTACATTCCCAACCGCGTGGCACAA* 3' (SEQ ID 
NO:69 and 5' GGCAAC.GCCAATCAGCAACGACTGTTTGCCCGCCAGTTGT* (SEQ H) 
NO;70) ill lanes 3 and 4; and 

5' TACATTCCCAACCGCGTGGCACAACAACTGGCGGGCAAAC* 3' (SEQ ID 
N0:71) and 5' GGGCCAGACTGGAGGTGGCAACGCCAATCAGCAACGACTG* 3' (SEQ 
ID NO:72) in lanes 5 and 6. 

[0283] The PAP reaction mixture contained a total voliime of 25 nl: 50 niM KCl, 20 mM 
HEPES/NaOH (pH 6.9 at 25 °C), 10 mM (NH4)2S04, 1.5 mM MgCl2, 40 fiM each of the four 
dNTPs (dATP, dTTP, dGTP, dCTP), 0.1 ^iM U, 150 ^M Na^Pps, 4% DMSO, 1.0 U of 
AmpliragFS polymerase (PE Applied Biosystems) and 400 copies of mouse gencftnic DNA. 
The cycling conditions were 94 "C for 15 sec, 65 "C for 30 sec and 72 "C for 1 min. The 
unincorporated P*s were separated well from flie Bi-PAP product on 2% agarose gel. No dimer 
was seen. Fig. 17B shows the results for th^e Bl-PAP assays. In lanes 1, 3 and 5, the wild-type 
templates are amplified. Lanes 2, 4 and 6 are negative controls without mouse genomic DNA. 
[0284] Three PAP assays directly ampUfied 180-bp segments of flie DI receptor gisae j&om 
human genomic DNA with strong signals of PAP products. The allele-specificity of 26-mer P* 
remains when the mismatch is at 2 nucleotides from the 3' tecminus, but tiie allele-specificity is 
lost when ftie mismatch is at 24 nucleotide from the 3' terminus. Three Bi-PAP assays directly 
aiaqjlified as low as four hundred copies of Ihe lad gene from mouse genomic DNA. The P* 
oligonucleotides have different deoxjTiucIeotides blocked at the 3' terminus and all can be 
effideirtly activated. Addition of extra human DNA did not affect the amplification of the loci 
gene in mouse genomic DNA, The product of Bi-PAP was easily distinguished from 
unincorporated P*s. P* does not form dimmers because P* needs long and perfectly matched 
regions at the 3' tenhinus for activation. 

EXAMPLE II 
PAP with Acyclonucleotides and Various Polymerases 
X Phage DNA Template 
[0285] The wild-type k phage DNA tempMs that contains an inserted wild-type lad gene of ^. 
coli (Kohlea: et al., 1991) was purchased from Stratagene. The mutant X phage DNA template 
was prepared from X phage plaques transformed into SCS-8 E. coli cells according to Maniatis, 
et al. (1982). It contained a T to G mutation at nucleotide 369 in the lad gene. The amount ofX 
phage DNA was determined by UV absorbance at 260 mn. 
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Synthesis of P* by Adding Acyclonucleotide or a Dideoxyoucleotide at the 3' Terminus' 
[0286] The 3' tenninal acyclonucleotide or 3' terminal dideoxynucleotide was added to a 
deoxynucleotide oligonucleotide by termiaal transferase. The mixture contained a total volume 
of 25 p.1: 100 mM potassium cacodylate (pH 7.2), 2.0 mM C0CI2, 0.2 mM DTT, 2 nM of the 
oligonucleotide, 2.4 mM acycloNTP (the molar ratio of the 3 -OH temiinus to acycloNTP was 
1:30) (New England BioLabs), or 2.4 mM 2',3'-ddNTP (the molar ratio of the 3'-0H terminus to 
ddNTP was 1 :30)(R.oche), 100 U of terminal transferase (In\dtrogeii). The reaction, was 
incubated at 37 °C for 6 hr and then stopped by adding EDTA to a 5 mM final concentration. 
After desalting using a Centri-spin'^" column (Princeton Separations), P* was purified by 
preparative 7 M urea/18% polyacrylamide gel electrophoresis with 30 mM 
triefhaQolamines/tricine bufiffer (pR 7,9 at 25°C) (Maniatis, et al., 1982; Liu, et al., 1999b). The 
amount of recovered P* was determined by UV absorbance at 260 mn. 

[0287] Since small amounts of unterminated oHgoimcleotide would result in unexpected PCR 
anqjlifieatioii, the pntity of P* was tested by the absence of PCR product at pH 8.3 in which 
pyrophosphorolysis is inhibited. It is estimated that more than 99.99% of F* contained an 
acyclonucleotide or a dideoxynucleotide at the 3' terminus. 
PAP Amplification 

10288] PAP was examined with P*l and Ol, with P*2 and 02, and with P*l and P2* 
respectively (Fig, 18A and Table 6). The P*s were 30 or 35 nucleotide long and contained an 
acyclonucleotide or a dideoxynucleotide at tiie 3' terminus. 
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[0289] The PAP reaction mixture with AmpIira^FS DNA polymerase contained a total volume 
of 25 \ih 50 mM KCl, 20 mM HEPES/NaOH (pH 6.9 at 25 °C), 10 mM (NH4)2S04, 1.5 roM 
MgCl2, 50 jiM each of the four dNTPs (dATP, dXTP, dGIP and dClP), 0.1 pM of each 
oligonucleotide, 150 Na4PPi, 4% DMSO, 1 U of AmpHTb^FS DNA polymerase (PE 
Applied Biosystems), 0.1 ng of the phage DNA template. The cycling conditions were 92 °C 
for 10 ssCi 65 °C for 30 sec, and 72 °C for 1 min for a total of 30 cycles, A daiaturing step of 92 
°C for 1 min was added before the first cycle. 

[0290] The PAP reaction mixture with Vent (exo-) or P/U (exo-) contained a total vokime of 
25 nl: 10 mM KCl, 20 mM HEPES/NaOH (pH 7.19 at 25 °C), 10 mM (NEl4)2S03, 1.2 mM 
MgCl2, 50 pM each of the four dNTPs (dATP, dTTP, dGTP and dCTP), 0.1 pM of each 
oligonucleotide, 150 ]M Na4PPi, 4% DMSO, 1 U of Vent (exo-) DNA polymerase (New 
England BioLabs) or Pfu (exo-) DNA polymerase (Shratagene), 0.1 ng of the X phage DNA 
template. The cycling conditions were 94 °C for 15 sec, 60 °C for 30 sec, and 72 "^C for 1 min 
for a total of 30 cycles. A denaturing step of 94 °C for 1 min was added before the first cycle. 
[0291] The product was electrophoresed tlirough a standard 2% agarose gel. The gel was stained 
with ethidium bromide for UV photography by a CCD camera (Bio-Rad Gel Doc 1000). 
[0292] As shown above, TaqFS, a genetically engineered DNA polymerase (Tunis and Gelfand, 
1999), greatly improved the efaciency of PAP. 3' terminal dideoxynucleotide blocked P*s can 
be activated by pyrophosphoroiysis to remove the 3' terminal dideoxynucleotide in the presence 
of pyrophosphate (PPj) and the complementary strand of the allelic template. Then the activated 
P* can be extended by DNA polymerization. 

[0293] PAP performed with 3' acyclonudeotide blocked P*s by using X phage DNA 
contaming the lad gene as model system. P*l and P*2 are downstream and upstream blocked 
oligonucleotides, respectively, for the same mutation (Fig. 18A and Table 6). The P*l and P*2 
have an acycloGMP and acycloCMP at their 3' tesmini, respectively. Amplification products 
were absent without pyrophosphate added at pH 8.3 where pyrophosphoroiysis is inhibited, 
showing that P*l and P*2 were not directly extendible. 

[0294] P*l and P*2 are specific to the mutated template but mismatch to the wild-type tenq)Iate 
at their 3' tenninL The mutated template was amplified efSciently by PAP with one 
acyclonudeotide blocked P* and one opposing unblocked ohgomicleotide and by PAP with two 
opposing 3' terminal acyclonudeotide blocked P*s (lanes 1 and 2 in Fig. 18B), with two 
opposing acyclonudeotide blocked P*s (a spedal form of PAP wbexe the two opposing P*s are 
overlapped at their 3' temaini by one nucleotide) (Land 3 in Fig. 18B). However, no product was 
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generated from the wild-type template because of tiie rmsmatch at the 3' terminus, showing the 
specificity (lanes 5-1 m. Fig. 18B). PAP with the 3' dideoxynucleotide blocked P* showed 
similar results (Issies 946 in Fig. 18B). Direct sequencing analysis confiiiued the correct 
sequence of the amplified product. The effect of P* length was also tested. Similar results were 
obtained witti 35-mer P*s that are co-terminal with the 30-mer P*s and five nucleotides longer at 
their 5' tenrani (Fig. 18C). Other P*s specific for the wild-type sequence at the 3' terminus 
(with acycloTMP and ddTMP) were also tested with similar results. 

[0295] Family E DNA polymerases Vent (exo-) and Pfii (exo-) were tested using the above 
model system. With the acyclonucleotide blockeir and ^perfect match at the 3' temaions, the 
mutated template was arapMed efficiently by PAP with one P* (lanes 1 aad 2 in Figs. 18D and 
18B) and one cpposang unblocked oligonucleotide and by PAP with two opjiosing P"^ of P*l 
and P*2 (a special form of PAP where the two opposing P*s are overlapped at their 3' termini by 
one nucleotide) (lane 3 in Figs. 18D and 18E). However, no product was generated fcom the 
wild-type template becaiKe P*l and P*2 mismatch the wild-type template at their 3' termini, 
showing the specificity (lanes 5-7 in Figs. 18D and 18E). Vent (exo-) and Pfii (exo-) 
polymerases could not ainplify with the 3' dideoxynucleotide blocked P* (laaes 9-16 in Figs. 
18D and 18E). Dii:e<3t sequencing analysis conJBrmed the correct sequence of the P*l/01 and 
P*2yO 2 pripducts. Similar results were obtained with AcycloPol (PerMn-Ehner), a genetically 
engineered Family E archeon DNA polymerase. It is not clear why PAP with Vent (eso-) and 
Pfu (exo-) DNA polymerases discriminates against 3' dideoxyribonucleotide blockers. 
Other Blockers 

[0296] These results demonstrate that two terminators used in Sanger sequencing can be used as 
blockers in PAP. Terminators have also been described as therapies of viral illnesseg, such as 
AIDS, and for cancer therapy, such as, S'-deoxyadenosine (cordycepin), 3'-azido-3'- 
deoxyUiymidme (AZT), 2',3'-dideoxyinosine (ddl), 2',3'-dideoxy-3'-thiacytidine (3TC) and 2',3'- 
didehydro-2',3'-dideoxythynudine (d4T). DNA poljmerase can incorporate their triphosphate 
form into the synthesizing strand, and the iacorporation cause termination of the extension 
(Gardner and Jaclc, 1999; Cheng et al., 1987; St Clair et al, 1987; Ueno and Mitsuya, 1997). 
The monophosphate nucleotides of 3'-azido-3'-deoxythymidine (AZT), 2',3'-dideoxy-3'- 
tliiacytidiae (3TC) and 2',3'-didehydro-2',3'-dideoxyfhymidine (d4T), when located at the 3' 
teamini of oHgouucIeotides, can be removed by pyrophosphorolysis by HIV reverse transcriptase 
or its variants (Anon et al., 1998; Gotte et al., 2000; Meyer et al, 2000; Urban et al., 2001). 
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These results indicate the application of PAP for various types of blockers and for RNA 
templates. 

[0297] In summary, PAP amplification occurred efficiently and specifically with 3' 
acyclonucleotide and 3' dideoxytiucleotide blockers using TaqFS DNA polymerase, and only 
5 with acyclonucleotide blockers using Vent (exo-) and Pfu (exo-) DNA polymerases. Other 3' 
terminal nonextendible oligonucleotides and other DNA polymerases can be used, if the 3' 
terminal nucleotide can be removed by pyrophosphorolysis, and the activated oligonucleotide 
can be extended. 

10 EXAMPLE 12 

Detection of Extremely Rare Alleles by Bi-PAP 
A, Phage DNA Template 

[0298] The wild-type X phage DNA template that contains an inserted, wild-type lad geaie of E. 
coli (Kohler et ai., 1991) was purchased fiom Stratagene. Three mutated X phage DNA 

15 templates were prepared from X phage plaques transfomied into SCS-8 E. coli cells according 
to Maniatis et al ( 1982) . They contain, an A to T mutation at nucleotide position 190, a T to G 
mutation at nucleotide 369 and a T to C mutation at nucleotide 369 in the lad gene, 
respectively. The amount of />, phage DNA was determined by UV absorbance at 260 nm. 
Synthesis of P* by Adding a 3' Dideoxynucleotide 

2 0 [0299] The 3' tenmnai dideoxynucleotide was added to an oUgodeoxynucleotide by termiaal 
transferase. The mixture contained a total volume of 25 jil: 100 mM potassium cacodylate (pH 
7.2), 2.0 mM C0CI2, 0.2 mM DTT, 2 nM of the oligonucleotide, 2.4 mM 2\ 3'-ddNTP (the 
molar ratio of the 3'-0H temoinus to ddNTP was l:30)(Roche), 100 U of terminal transferase 
(Invitrogen). The reaction was incubated at 37 °C for 6 hr and then stopped by adding H3TA to 

25 a 5 mM final concentration. After desalting using a Centri-spiu"^*' column (Princeton 
Separations), P* was purified by preparative 7 M ureayi6% polyacrykmide gel electrophoresis 
wifli 30 raM Triethanolamine/Tricine buffer (pH 7.9 at 25''C) (Jitoiiatis et al . , 1982, Lm 
et al., 1999b). The amount of recovered P* was detenniaed by UV ait^orbance at 260 mn. 
[0300] Since small amounts of untenninaled oligonucleotide would result in imexpected PGR 

30 amplification, P* was ^?P-labeled at the 5' terminus by T4 polynucleotide kinase and theai was 
electrophoresed through a 7 M urea/20% polyacrylaraide gek Only P* products were visible 
even when the gel was overexposed. It is estimated that more than 99.99% of P* contained a 
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dideoxynucleotide at the 3' terminus. The purity of P* was siqjported by the absence of PGR 
product at pH 8.3 in which pyrophosphorolysis is inhibited. 
PAP Amplification 

[0301] Bi-PAP assays for nucleotide 190 and nucleotide 369 of the /acTgene Were examined. 
The P*s were 40 nucleotides long except that the upstream P*s for position 369 are 42 
nucleotides. Each P* contained the sequence-specific nucleotide at the 3' tenninns. The PAP 
reaction mixture contained a total volume of 25 fil: 50 mM KCl, 20 DQM.HEPES/NaOH (pH 6.9 
at 25° C), 10 mM (1014)2804, 1.5 mM MgCU, 40 (jM each of the four dNTPs (dATP, dTIP, 
dGT? and dCTP), 0.1 ]iM each P*, 150 yM Na+PPi, 4% jmSO, I fiCi of [o-^^HCTP 
C3000Ci/namole, Amersham), 1 U of AmpUJagFS DNA polymerase (PE Applied Biosystems), 
2,000 copies of the X phage DNA template or stated elsewhere. The cycling conditions were 
92° C for 6 sec, 68° C for 20 sec/aad 72*' C for 20 sec for a total of 35 cycles. A denaturing step 
of 92°C for 1 min was added before the first cycle. 

10302] The product was . electrophbresed through a standard 2.5% agarose gel, and the gel was 
stained with ethidium bromide for UV photography by a CCD camera (Bio-Rad Gel Doc 1000). 
[0303] In order to differentiate the mutated product from the wild-type product of the same size, 
non-denaturing SSCP gel eleelnophoresis was performed (Orita et al., 1989). The reaction was 
mixed wilt two-fold volume of loading bufier (7M urea and 50% formamide), boiled and 
rapidly cooled on ice. The product in 10 \il of the mixed reaction was electrophoresed through 
an 8% non-denaturing PAGE-PLUS (Amresco) gel witli 30 mM Ethanoiamine/Capscp buffer 
(pH 9.6) (Liu et al., 1999b) at 4°C. The gel was dried and exposed to Kodak X-QMAT™ AR 
film for autoradiography. Three or four bands from each amplified product were seen on a gel. 
The upper one or two bands were double sfrained DNA due to hybridization of de-natuied 
single-stranded segments during the electrophoresis as a result of the substantial amounts of 
amplified product present, hicreasing the concentration of the amplified product fiirfhea: increase 
the intensity of the upper bands. 

Highly Efficient PAP Amplification 
[0304] TaqFS, a genetically engineered DNA polymerase greatly improved Uie efSciency of 
PAP. The conditions of PAP were further optimized for diamaiicaaiy higher efBciencies 
allowing PAP to amplify dnrectly fiiom a few copies of X phage DNA or human genomic DNA 
template. The reaction components and the thermocycling regime were optimized, includmg: i) 
deoreased conoentrations of PPi in that keeping the PPi to dNTP ratio essentially constant, ii) 
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use of low pH HEPES buffer (pH 5.9 at 25" C), iii) addition of (1SIH4)2S03, iv) increased amount 
of TaqFS, and v) higher aimea3ing temperature. 
Bi-PAP 

I030S] PAP has a potential selectivity of 3.3xl0":l (Fig. 19). Approaching this potential 
5. requires a design tliat eliminates confounding sources of error. The A190T mutation of ttie lad 
gene of A, DNA is used las a niodel system. In PAP wilii one, downstream P* and one upstream 
unblocked oligonucleotide, extension errors ftom the non-blocked upstream oligonucleotide can 
produce tiie rare mutation of ioterest, tlius reducing the selectivity. If the nddsincorpoTation rate 
of TagVS is 10"^ per incorporated nucleotide and one of the three possible misincorporations 
1 0 generate the A->T mutadon on the newly synthesized upstream strand, the selectivity decreases 
to 3.3x10"^ due to the side effect. In order to retaove this limitation, Bi-PAP was developed 
(Fig, 20A). In Bi-PAP, botti die downstream and upstream oligonucleotides are P*s that are 
specific for the nucleotide of interest at their 3' termini. The P*s overlap at their 3' termini by 
one nucleotide. 

15 [0306] Bi-PAP amplified efficiently and specifically at nucleotide positiim 190 using X phage 
DNA containing the lad gene as template (Fig. 20B). Addition of human genomic DNA did not 
affect the amplification. The 79-bp product of Bi-PAP was easily distinguished fi-om 
imincorporated P*s. P* did not form dimers because P* needs a perfectly matched region at the 
3' terminus for activation., Similar results were observed at nucleotide position 369. Direct 

2 0 sequencing analysis confiimed tlie correct sequence of tlie amplified product. 

Sensitivity and Selectivity of Bi-PAP 
[0307] In order to demonstrate the extremely high selectivity of Bi-PAP, more tlian 10^° copies 
of DNA template was used for a Bi-PAP reaction. X DNA containing the lad geae of E.coli 
was chosm as the model sjretem because 1 fig of A, DNA contains 2x10'*^ vector genomes, while 
25 l(ig of humaii geabmic DNA only oontaias 3370.0^ genomes. In ordcar to avoid pfotenttal 
contamination of the wild-type X DNA in this laboratory, mutation-specific Bi-PAP assays with 
mtitated P*s were chosen to amplify the wild-type X DNA The relative frequeaicy of a 
spontaneous mutation of the tad gene in the wild-type X DNA is estimated to be less than 10'' 
by examining X phage plaques infecting E. coU. 

3 0 [0308] The sensitivity and selectivity of Bi-PAP were examined using three mutation-specific 

Bi-PAP assays with their corresponding mutated X DNA (see Table 7 footnotes for definitions). 
Four titration experiments were perfonned for each mutation-specific Bi-PAP assay (Figs. 21A- 
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21C). Experiment I tested how much, the mutated P* can "tolerate" the wild-type DNA template 
(z.e., the maximum copies of the wild-type template without a detectable mntated product). The 
wild-type X DNA was titrated from 2x10^° copies to 2x10^ copies. The maximum tolerances 
WCTe 2x10^ to 2x10'", 2xlO' to 2x 10^ and 2x10^ to 2x 10^ respectively, for the three mutation 
5 specific Bi-PAP assays, respectively (Figs. 21A-21C). Experiment n tested the sensitivity of Bi- 
PAP. The mutated k DNA was titrated &om 2x10^ to 0 copies. The ratio of the maximum 
tolerance (Experiment I) to the sensitivity is the selectivity. Ejiperimemt 11 repeated in the 
preseooe of large amount of wild-tj^pe tencqjkte (Experiment IE) or large anounts of human 
genomic DNA (Experiment IV) without effects (Fig. 21A; data not shown for T369G and 
10 T369C). A dose response with template copy number was obsesrved. 



TABLE? 

Summary of the three mutation-specific Bi-PAP assays" 



Assay 


Position" 


Type^ 


Sensitivity'' 


Seledivitity" 


A 


190 


A;T-^T:A 


2 


10':1 to 10'";1 


B 


369 


T:A->G:C 


2 


lO'iltoltf^rl 


C 


369 


T:A-^C:G 


2 


10':1 to IQ":! 



X5 "hi each of the three mutation-specific Bi-PAP assays, two opposite P*s with one nucleotide overlap at 
their 3' termini were used, , The P*s are 40-42 nucleotides long. They . are 
5'GATGGCGGAGCTGAATTACATTCCCAACCGCGTGGCACAT* (SEQ ID N0;61) and 
5' GGCAACGCCAATCAGCAACGACTGTTTGCCCGCCAGTTGA* (SEQ ID NO:62) in Assay A; 
5'GAAGCGGCGTCGAAGCCTGTAAAGCGGCGGTGCACAATCG* (SEQ ID NO:63) and 

20 5' GCGGATAGrTAATGATCAGCCCAC TGACGCGTTGCGCGAGAC* (SEQ ID NO:64) in Assay 
B; 5'GAAGCGGCGTCXjAAGCCrrGTAAAGOGGOGGTGCACAATCC* (SEQ ID NO:65) and 
5' GCGGATAGTTAATGATCAGCCCACTGACGCGTTGCGCGAG AG* (SEQ ID NO:66) in Assay 
C. 

The position of the first nucleotide of transcript in the lacT gene is assigned the nucleotide position 1 
2 5 (Farabaugh, 1978). The 3' nucleotide of the P* is located at tiie indicated position and is complementeiy 
to the corresponding mutation. 

" The sensitivity is defraed as the minimum copies of the mutated template jfrom which a detectable 
mutated product is generated when a mutation^ecific Bi-PAP assay is used. It was detemiiaed by 
Experiment n (Figs. 21A-21C). 
30 ^ TTie selectivity is the ratio of the maximum copies of the wild-type template with undetectable product 
the minimum copies of (he mutated template with detectable product to, when a mutation-specific Bi- 
PAP assay is used. 



[03091 The j^proximately 100-fold difference in selectivity between the nucleptide positions 
35 190 and 369 may derive from: i) the presence of spontaneous mutations at the position 360 at a 
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frequency of 10"' to 10"^ in the wild-type X DNA, ii) impurity of P* oligonucleotides, iii) 
specificity of pyrophospliorolysis for a perfect match at the 3' temunus and fidelity of DNA 
polymerase to incorporate a correct nucleotide may be associated witli sequence context such 
ttat the Type H non-specific amplification occurs at a fiiequency of 10"' to 10^. In the latter 
case, a 100-fold difference in selectivity could arise fix>m a 10-fold differeace in 
pyrophosphorolysis specificity and a 10-fbld difTerence in DNA polynierase fidelity Tvifh 
sequence context 

[0310] The rate of a spontaneous mutation of A, phage in E. coli varies firom locus to locus, on 
the average ficom 10"^ to 10"" per incorporated nucleotide. The antpUfied signal seen in 
Experiment I might be caused by rare spontaneous mutations. 

[0311] There is a possible side reaction due to Ihe impurity of P* contamination of unblocked 
oligonucleotide where the dideoxy terminus has not been added, although no unblocked 
oligonucleotide was detected in the P*. However, this selectivity may not be limited severely by 
small amounts of unblocked oligonucleotide because the product generated would be much 
more hkely to be the wUd-type rather than Ihe specific mutation (3.3x10^:1). 
[0312] In summary, Bi-PAP has extremely high sensitivity and selectivity. Bi-PAP can 
selectively detect two copies of raie mutated allele with a single base substitution firom up to 
2x1 0' copies of the wild-tj^e allele. Bi-PAP is a simple, r^id, automatable method for 
detectibog any rare allele of interest. 

EXAMPLE 13 

Measurement of Mutation Load in Mouse Tissues by Bi-PAP 
Materials and Methods 

[0313] Liver, heart, adipose tissue, cerebrum and cerebellum from 10-day to 25-month old mice 
were snap frozen and stored under liquid nitrogen until used. DNA w£^ extracted according to 
the Big Blue protocol (Stratagene instruction manual). In brief, tissues were homogenized and 
digested witli proteinase K. The genomic DNA was extracted with phenol/chlorofoim and 
precipitated with ethanol. The DNA was dissolved in TE buffer (10 mM Tris/HCl, 1 mM 
EDTA, pH 8.0) and stored at 4 °C. The amount of the mouse geatiomic DNA was determined by 
UV absorbance at 260 nm. 

[0314] The mutation-specific Bi-PAP assay for T369G (Assay B: the two opposite P*s are 
dideoxynucleotide blocked with one nucleotide overlap at then 3' termini are: 
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5'GAAQCGGCGTCGAAGCCTGTAAAGCGGCGGTGCACAATCG*3' (SEQ ID 
NO:63) and 

5' GCGGATAGTTAATGATCAGCCCACTGACGCGTTGCGCGAGAC*3" (SEQ ID 

NO:64) 

of the lad gene was perfonned as above except that i) the reaction contained 2 ug of the mouse 
genomic DNA (~20 M> in si2B), unless oHierwise stated; ii) mouse DNA in 20 iil (1^5x HEPES 
buffer, 5% DMSO without MgCl2) was heated at 100°C for 2 min arid quickly cooled on ice, 
before Ihe other coo^onents added; iii) a deaiaturing step at 95''C for 1 min was added before the 
first cycle; iv) the denaturing step was 95°C for 10 sec. 

[03151 10 TJ.I of the 25 lal reaction was mixed with 10 jil of the denaturing loading buffer, boiled 
and r^idly cooled on ice. The product was electrophoresed through a 8% 7M urea /PAGE gel 
with 90 mM TBE buffeo: at room ten^jeratiire. The gel was dried and exposed to Kodak X- 
OMAT™ AR fihn for autoradiography. 
Results and Discussion 

10316] Transgenic mouse mutation detection systems permit detemiination of the frequency and 
pattern of spontaneous or induced mutations in vivo. The Big Blue® system uses transgenic mice 
harboring chromosomally-integrated X phage DNA containing the E. coli lad gene as the 
mutational target (Grossen and Vijg, 1993; Gossen et al., 1989; KoMer et al, 1990. The lad 
gene is integrated within each mouse diploid genome in 40 tandemly repeated X DNAs. 
[0317] The Big Blue® mutation detection system assay is performed by isolating genomic DNA 
from transgenic mouse tissues and mixing it with X packaging extracts. The packaged X phage 
can infect E. coli. ha the presence of X-gal substrate, /crc/mutants give rise 1:o blue plaques on a 
background of colorless \vild-type plaques. Ob.served mutants derive overwhelmingly from the 
mouse (Hill et al., 1999). The mutant frequency is determined by dividing the number of circular 
blue plaques by the total number of plaques. Of 5000 sequenced mutant plaques, 31 T369G 
mutants have been found in a total of 149x10^ plaques screened from various ages, genders and 
treatments in this laboratory (frequency - 2.1x10"')- 

[0318] To assess the utility of Bi-PAP for measuring rdtra-rare mutations in mammalian cells, 
the T369G mutation was analyzed in genomic DNA from the Big Blue mice. Two of mouse 
genomic DNA was amplified in 25 ul reaction containmg a total of 1.2x10^ copies of the lad 
gene. The mutation-specific Bi-PAP assay for T369G (Assay B) was performed for 18 sao^les 
in duplicate (Fig. 26A). Three categories of results were defined, each witii similar number of 
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samples: 1) six samples were positive two times (5, 11-15), 2) seven samples were positive one 
time (1, 3, 6, 9, 16-18), and 3) five samples were negative two times (2, 4, 7, 8, 10). 
[0319] Two samples in each category were studied fiirtlier (Figs. 26B-26C, Table 8). In category 
1, for the two samples 5 and 12 witli the strongest amplified signals (Fig. 26A), a four-fold 
dilution to 0.5 ]ig and 16-fold dilution to 0.125 ]ig of mouse genomic DNA were perfonned fer 
fMier quantitation (Fig. 26B). The T369G mutant frequency for each sample was estimated 
and varies 370-fold among the six samples (Table 8). The avesrage T369G mutant fi^jueney of 
2.9x10"' was within 50% of the average T369G mutant frequency of 2.1 xlO"^ measured from 
4x10' plaques using the Big Blue® mutation detection system and confirmed by direct 
sequencing. 



TABLES 



Somatic mutant frequesncy measured by Bi-PAP 





Sample" 


Mouse genomic DNA 


Frequency of positive amplification" 


Estimated 

mutant 
frequency^ 






Tissue 


Age 


2ug of 
DNA 


0.5 ug of 
DNA 


0.125 ]ig of 
DNA 




1 


12 


Adipoise 


6 months 


8/8 


8/8 


4/8 (0.69)'= 


9.25x10-' 


2 


5 


Liver 


25 months 


8/8 


7/8 


5/8 (0.9S) 


i-aixio"" 


3 


3 


Liver 


25 monttis 


8/24 (0.41) 






3.38x10-" 


4 


9 


Liver 


25 months 


13/24 (0.78) 






6.50xlO^\ 


5 


7 


Liver 


25 months 


2/24 (0.09) 






7.25x10-" 


6 


10 


Liver 


25 months 


1/24 (0.04) 






3.52x10-' 


Average 

" Bee Fie. 26 A. 












2.91x10"' 



the ratio of the number of positive signals for the T369G mutation relative to the total number of 
reactions. 



flie avea-age number of T369G mutants per reaction is estimated using a formula (the frequency of zero 
mutants per reaction = e"", x is the average number of mutants per reaction) suppose that the mutant 
distributes m the reaction according to a Poisson distribution and that if one or more mutants are in the 
reaction, the amphfication is positive, and if zero mutant is m the reaction, it is negative. 

the frequency of the T369G mutant of the laci gene in mouse genome per reaction is estimated 
assmnmg that the mutant distri butes in mouse genomic DNA according to a Poisson distribution and that 
one or more mutants are positive in the detection. For each of samples 12 and 5, a total of -6.0x10*, 
copies of the tocf gene are used for the estimate, and for each of samples 3, 9, 7 and 10, ~2 9x10* copies 
are used assummg that 2 pg of the lact mouse genomic DNA contans -1.2x10^ copies of Ihe lad gpae 
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[0320] The 370-fold variation in mutant frequency was observed in livers of five mice at 25 
months of age. This large variation could be due to difficulties in amplifying one copy of the 
template. To address tHs issue, eacli of the analyses was repeated at least two times with similar 
results. For example, in sample 9, seven of 14 reactions with 2 ug of DNA were positive in one 
experiment, three of four such reactions were positive in another experiment, and two of four 
such reactions were positive in a third experimmt For sample 7, there was one positive in eight 
and one positive in 14 reactions. The product was sequenced to confeia the T369G mutation 
after re-anqjlification from the positive reaction. In addition, positive controls (2 ug of the lad* 
mouse DNA with --10 copies of T369G) and negative controls (mouse genomic DNA wittiout 
the lad target, Le., the tec/ " mouse DNA) were perfomed. As additional positive controls, 
reconstruction experiments were performed in tiiat the copy number of the mutated X DNA per 
reaction was s^ally diluted by two-fold in the presence of the /ocT genomic DNA carrier. 
R^roducible an^Iifications from as low as one copy of template were demonstrated {Figs. 26B, 
26C). 

[0321] In retrospect, the 370-foid variation in the frequency of T369G mutant observed among 
the six mice may not be surprising because tiie T369G mutant fiequency among mice is over 
dispersed, implying a hyper-Poisson distribution (Nishino et al., 1996; Piegorsch et al., 1994). 
Among six mice the inter-animal variation in the overall mutant frequency assayed fay the Big 
Blue® mutation detection system might be 3 to 4 fold, witti significant founder effects in one or 
a few of the mice. The variation might be in the range of 2x10"^ to 8x10'^ which is the sum of 
more than 1,000 different mutatioDS. Here, only the T369G mutation is assayed. It is anticipated 
that the great majority of the signal derives from duplex mutated templates (Hill et al., 1999), 
but it should be noted that unresolved mismatch intermediates derived primarily from DNA 
replication or DNA repair would also generate a signal Thus, the physical limit of sensitivity is 
actually one half of a duplex DNA molecule per reaction. 

[0322] In conclusion, we demonstrate that Bi-PAP can analyze ultra-rare mutatioiis at 
frequencies as low as 10"'' to 10^^ depending on die assay. It is shown that Bi-PAP can detect 
single copies of the somatic mutation directiy from mammahan genomic DNA. The inter-assay 
variation may reflect locus-specific variability in the assay sensitivity or in the frequency of the 
assayed mutants among the samples. More work is necessary to distinguish betweetn these 
possibilities. In mammalian DNA, the number of copies of template is limited by the enomous 
genome size. Two ug of genomic DNA contaios only 600,000 mouse haploid genomes, yet the 
reaction is viscous. Our analysis of tiie Big Blue mouse genomic DNA was faciHtated by tiie 20 
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copies of the lad gene per haploid genome. To measure mutation load in humans, genomic 
DNA JB one reaction could be increased at least tliree fold by reducing the viscosity {e.g., 
shearing the DNA into small segments by ultrasonic treatment) and another four fold by 
expanding the reaction volume to 100 ul. Mutation load in human genomic DNA might be 
facilitated by analyzing segments of virtually identical sequence, e,g., there are three 9.6 kb 
segments with 99Vo sequence identity on human X chromosome involved in a common 
inversion mutation in hemophilia A (Laldch et al., 1993). Less complex genomes including C- 
elegans, Drosophila, and hmnan mitochondria genome or chrome viral infections (e.g., hepatitm 
B) also should be analyzable with this protocoL 

[0323] While Ihe mvention has been disclosed in this pateat application by reference to the 
details of preferred embodnneaits of tiae invention, it is to be understood that the disclosure is 
intended in an illustrative rtfher IJian in a limiting sense, as it is contemplated that modificatioiis. 
will readily occur to those skilled m the art, withui the spirit of the invention and the scope of 
the appended claims. 
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WHAT IS CLAMED IS: 



1 . A method of detecting a nucleic acid which, comprises : 

(a) annealing to a nucleic acid a first oligonucleotide V*, wherein the jSrst 
oligonucleotide P* has a non-extendable 3' end, wherein the 3 'non-extendable terrmnus 
of first oligonucleotide P* is removable by pyrophosphorolysis; 

(b) removing the 3' non-ejctendable terminus of the first oligonucleotide P* by 
pyrophosphorolysis to produce an unblocked first oligonucleotide; 

(c) extending the unblocked first oligonucleotide; 

(d) detecting -flie presence of the nucleic acid by detectmg the extended fiist 
oligonucleotide. 

2. The method of claim 1, wherein pyrDphosphorolysis is peifoimed with a nucleic acid 
polymerase. 

3 . The meliiod of claim 1 , wherein extension is performed with a nucleic acid polymerase. 

4. The method of claim 1, wherem a nucleotide or nucleotide analog present in the 
extension sbep contains a label and the presence of a label in the extended first 
oligonucleotide is detected. 

5 . The method of claim 1 , wherein steps (a), (b) and (c) stq repeated. 

6. The method of claim 1, wherein the 3 'non-extendable terminus of first oligonucleotide 
P* is a nucleotide or nucleotide analog which cannot be extended by a nucleic acid 
polymerase but which can be removed by pyrophosphorolysis. 

7. The method of claim 6, wherein the nucleotide or nucleotide analog is selected from the 
gi-oup consisting of a 3'deoxynucleotide, a 2',3'-dideox3Tiucleotide, an acyclonucleotide, 
3'-deoxyadenosine (cordycepin), 3'-azido-3'-deoxyfhymidlne (AZT), 2',3'-dideoxyinosin6 
(ddl), 2',3'-dideoxy-3'-thiacytidine (3TC) and 2'^'-dideihydro-2',3'-dideoxyl3iyimdine 
(d4T). 
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8. The method of claim 1, wherein a second oligonucleotide is annealed to a 
complementary strand of the nucleic add. 

9. The method of claim 8, wherein the second oligonucleotide is an unblocked second 
oligonucleotide. 

10. The method of claim 9, wherein the second oligonucleotide is extended ia step (c). 

11. The method of claim 9, wherein the second oligonucleotide is a second oHgonucleotide 
P* which has a non-extendable 3* end, vsiierein the 3* non-extendable terminus of the 
second ohgonucleotide P* is removable by pyrophosphorolysis. 

12. The method of claim 11, wherein the 3' non-extendable terminus of 1iie second 
oligomulceotide P* .is removed by pyrophosphorolysis m step (b) to produce aa 
unblocked second oligonucleotide and wherein Ihe unblocked second, olignonucleotide is 
extended in step (c). 

13. The method of claim 11, wherein the S'non-exteadable terminus of the second 
oligonucleotide P* is a nucleotide or nucleotide analog which cannot be extended by a 
nucleic acid polymerase but which can be removed by pyrophosphorolysis. 

14. The method of claim 13, wherein the nucleotide or nucleotide analog is selected irom tlie 
group consisting of a 3'deoxynucleotide, a 2',3'-dideoxynucleotide, an acyclonucleotide, 
3'-deoxyadenosine (cordycepin),.3'-azid6-3'-d6oxythynudhie (AZT), 2',3'-dideoxyinosiae 
(ddl), 2',3"-didepxy-3'-tfaiajcytidine (3TC) and 2',3'-didehydro-2'3'-dideox5j&yraidine 
(d4T). 

15. The method of claim 1 1 , wherein the 3 'non-extendable terminus of first oligonucleotide 
P* is a nucleotide or nucleotide analog which cannot be extended by a nucleic acid 
polymerase but which can be removed by pyrophosphorolysis and wherein the 3 'non- 
extendable terminus of the second oligonucleotide P* k a nucleotide or nucleotide 
analog which cannot be extended by a nucleic acid polymerase but which can be 
removed by pyrophosphorolysis. 
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16. The meliiod of claim 15, wherein the nucleotide or nucleotide analog is selected from tlie 
group consisting of a 3'deoxynucleotide, a 2'3'-dideoxyiiucleotide, an acyclonucleotide, 
S'-deoxyadenosine (cordycepln), 3'-azido-3'-deoxyfhymidine (AZT), 2',3^dideoxyinosine 
(ddT). 2',3'-dideoxy-3'-thiacytidine (3TC) and 2',3'-didehydro-2',3'-dideoxythyimditte 
(d4T). 

17. A method of synthesiziiig a nucleic acid which comprises : 

(a) annealing to a nucleic add a first oligonucleotide P* which has a non- 
ejcteudable 3' end, wherein the 3' non-extendable tertninus of the Grst oligonucleotide P* 
is removable by pyrophosphorolysis; 

(b) removing the 3' non-eostesadable terminus of the first oligonucleotide P* 
annealed to the nucleic acid by pyrophosphorolysis; and 

(c) extending the unblocked first oligonucleotide using a nucleic acid 
polymerase. 

18. The method of claim 1 7, wherein steps (a) to (c) wee repeated. 

19. The method of daim 17, whsrein pyrophosphorolysis and extension are perfonned with 
a nucleic acid polymerase. 

20. The method of claim 17, wherein a second ohgonucleotide is annealed to a nucleic acid 
strand which is the complement of said first strand. 

21. The method of clahn 20, wherem the second ohgonucleotide is an unblocked second 
ohgonucleotide. 

22. The method of claim 21 , wherein the second ohgonucleotide is extended in step (c). 



23. 



The method of claim 22, wherein steps (a) to (c) are repeated. 
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The metliod of claim 20, wherein the second oligonucleotide is a second oUgonucleotide 
P* which has a non-extendable 3' end, wherem the 3' non-exteadable terminus of the 
second oligomcleotide P* is removable by pyrophosphorolysis. 



25. The method of claim 24, wherein steps (a) to (c) are repeated. 

26. The method of claim 1 1, wherein tire 3 'non-extendable tennimis of first oligonucleotide 
P* is a nucleotide or nucleotide analog which cannot be extended by a nucleic acid 
polymerase but vMsh. can be removed by pyrophosphorolysis 

27. The method of claim 26, wherein the nucleotide or nucleotide analog is selected ftom the 
groiq) consisting of aS'deoxynucleotide, a 2',3'^iideoxyiiucleatide, an acyclonucleotide, 
3^-deoxyadenosine (cordycepin), B'-azido-S'-deoxythymidine (AZT), 2',3'-dideoxyinosine 
(ddl), 2',3'-dideoxy-3'-fhiacytidine (3TC) and 2',3'-didehydio-2',3'-dideoxyfhymidine 
(d4T). 

28. The me&od of claim 24, wherdn the S'noti-exteiidatole terminus of second 
oligonucleotide P* is a nucleotide or nucleotide analog which cannot be extraided by a 
nucleic acid polymerase but which can be removed by pyrophosphorolysis. 

29. The method of claira 28, wherem the nucleotide or nucleotide analog is selected from the 
group consisting of a 3'deoxynucleotide, a 2',3'-dideoxynucleotide, an acyclonucleotide, 
3'-4eoxyadenosine (cordycepin), 3'-a2ido-3'-deoxythymidine (AZT), 2',3'-dideoxyinosiQe 
(ddl), 2',3'-dideoxy-3'-thiacytidine (3TC) and 2',3'-didehydro-2',3'-did60xythymidine 
(d4T). 

30. The method of claim 24, wherein the 3 'non-extendable terminus of first oligonucleotide 
P* is a nucleotide or nucleotide analog which cannot be extended by a nucleic acid 
polymerase but which can be removed by pyrophosphorolysis and wherein the 3 'non- 
extendable terminus of the second oligonucleotide P* is a nucleotide or nucleotide 
analog which camot be extended by a nucleic acid polymerase but wiiich can be 
removed by pyrophosphorolysis. 
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3 1 . The mettiod of claim 3 0, whereia the nucleotide or nucleotide analog is selected from the 
group consisting of a 3'deoxyQucleotide, a 2',3'-dideoxyniicleotide, an acyclonucleotide, 
S'-deoxyadenosine (cordycepin), 3'-azidor3'-deoxythymidiae (AZT), 2',3'-dideoxyin.osine 
(ddl), 2',3'-dideoxy-3'-thiacytidiiie (3TC) and 2^3'-dideihydro-2^3'-dideoxyltymidi^ 

5 (d4T). 

32. A method of detecting a nucleic acid which comprises : 

(a) aunealiiig to a nucleic acid two oligonucleotide P*s, wherein each 
oligonucleotide P* has a non-extendable 3' end, wherein the 3 ' non-extendable termitms 

10 of each oligomicleotide P* is removable by pyrophosphorolysis, wherein one 

oligonucleotide P* overlaid with the ofher oligonucleotide P* by at least one nucleotide 
at their respective 3' ends, and wheieia one oligonucleotide P* anneals to a first nucleic 
acid strand and the other oligonucleotide P* anneals to a nucleic acid strand vMch is the 
cpmplemetit of the first nucleic acid strand; 

15 (b) removing the 3' non-extendable tesrraiims of the annealed first and second 

oligonucleotide P*s by pyrophosphorolysis to produce unblocked oligonucletides; 

(c) exteuding the unblocked oligonucleotides; and 

(d) detecting iJie extended oligonculeotides. 

20 33, Jhe method of cMm 32, wherein pyrophosphoroljrais is performed with a nucl^^ 
polymerase. 

34. The mefhod of claim 32, wherein extension is perfomied wilh a nucleic acid polymerase. 

25 35. The melhod of claim 32, wherein pyrophosphorolysis and earteBsion are perfomied with 
a nucleic acid polymerase. 

36. Tlie method of claim 32, wherein a nucleotide or nucleotide analog present in the 
extension step contains a label and the presence of a label in the exteaided 
30 oligonucleotide is detected. 



37. 



The method of claim 32, wherein steps (a), (b) and (c) are repeated. 
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38. The method of claim 32, wherein the 3 'non-extendable terminus of each oligonucleotide 
P* is a nucleotide or nucleotide analog which cannot be extended by a nucleic acid 
polyraerase but which can be removed by pjrophosphorolysis.. 

39. The method of claun 38, wherein the nucleotide or nucleotide analog is selected from the 
group consisting of a 3 'deoxynucleotide, a 2',3'-dideoxynucieotide, an acyclpnucleotide, 
3'-deoxyadenosine (cordycepin), 3'-azido-3'-deoxyfhymidine (AZT), 2',3'-dideoxyiiiosine 
(ddl), 2',3'-dideoxy-3'--thiacytidine (3TC) and 2',3'-didehydra-2',3'-dideoxyfhymidine 
(d4T). 

40. A method of nucleic acid detection which comprises: 

(a) synfliesizing a template micleic acid from the nucleic acid; 

(b) annealing to the template nucleic add a first oligonucleotide P* which has a 
non-extoadable 3' end, wherein the 3' non-extendable terminus of the first 
oligonucleotide P* is removable by pyrophosphorolysis; 

(c) removing the 3' non-extendable terminus of the first oligonucleotide P* 
annealed to the template nucleic acid by pyrophosphorolysis; and 

(d) extmding the unblocked oligonucleotide 

41. The mefliod of claim 40, wherein the template nucleic acid is synthesized by 
(i) annealing to one strand of the nucleic acid a first complcmentaiy ohgonucleotide that 
is sequence specific and extending said first complementary oligonucleotide on the 
nucleic acid strand with a nucleic acid polymerase to produce an extended first 
oligonucleotide and (ii) adding a second oligonucleotide to a 3' terminus of the extended 
first oligonucleotide to produce a twsphis! nucleic acid. 

42. The method of claim 41, wherein the second oligonucleotide is added to tiie ejcteoded 
first complementary oligonucleotide by ligation. 

43. The method of claim 41, wherein the second complementary oligonucleotide is added to 
the extended first complementary oligonucleotide by terminal transferase exteiosion. 
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44. The method of claim 41 which further comprises the step (al) amplifying the template 
nucleic acid. 

45. The method of claim 44, wherein tihe ampiificalion is a polymerase chain reactioa. 

46. The method of claim 44, wherein the amplification is by a pyrophosphorolysis activated 

polymerization. 

47. The method of claim 40, wherein the teoo^late nucleic acid is synthesized by a reaction 
which synthesizes cwpies of nucleic acid. 

48. The method of claim 47, wherein the amplification reaction is a polymerization chain 
reaction. 

49. The method of claim 47^ whaein tiie anq)lification reaction is a pyrophosphorolysis 
activated polymerization reaction. 

50. The method of clahn 40, wherein pyrophosphotolysis is performed with a nucleic add 
polymerase. 

51 . The method of claim 40, wharein extensionis performed with a nucleic acid polymerase. 

52. The method of claun 40, wherein pytophosphprolysis and extension are perfonned with 
a nucleic acid polymerase. 

53. The method of claim 52, wherein a nucleotide or nucleotide analog present in the 
extension step contains a label and the presence of ttie label in the extended 
oligonucleotide is detected, 

54. The method of claim 40, wherein steps (a) to (d) are repeated. 
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The method of claim 40, wherein the 3' non-extendable termini^ of oligomcleotide P* 
is a nucleotide or nucleotide analog which cannot be extended by a nucleic add 
polymerase but which can be removed by pyrophosphorolysis. 



56. The method of claim 55, wherein Qie nucleotide or nucleotide analog is selected fiom the 
group consisting of a 3'deoxynucleotide, a 2',3'-dideoxynucleotide, an acyclonucleotide, 
S'-deoxyadenosiae (cordycepin), 3'-azido-3'-deoxythymidiae (AZT), 2',3'-dideoxyinDsine 
(ddl), 2',3'-dideoxy-3'-thiacytidine (3TG) and 2',3'-didehydro-2',3'-dideoxythymidine 
(d4T). 

57. The method of claim 40, wheredn a second oligonucleotide is annealed to the template 
nucleic acid in step (b). 

58. The method of claim 57, wherein the second oligonucleotide is an unblocked second 
oligonucleotide which is extended in step (c-). 

59. The method of claim 57, whearein the second oligonucleotide is a second oligonucleotide 
P* which has a non-extendable 3' end, wherean the 3' non-extendable temainus of the 
second oligonucleotide P* is removed by pyrophosphorolysis in step (c) to produce an 
unblocked second oligonucleotide and wherdn the unblocked second oligapnucleotide is 
extended in step (cl). 



60. The method of claim 59, wherein the S'non-extendabie terminus of second 
oligonucleotide P* is a nucleotide or nucleotide analog which camnot be extended by a 

25 nucleic acid polymerase but which can be removed by pyrophosphoiolysis., 

61 . The method of claim 60, wherein the nucleotide or nucleotide analog is selected from the 
group consisting of a 3'deoxyiiucleotide, a 2',3 '-dideoxynucleotide, an acyclonucleotide, 
3'-deoxyadenosine (cordycepin), 3'-azido-3'-deoxyfhyxnidine (AZT), 2',3'-dideoxyinosine 

3 0 (ddl), 2',3'-dideoxy-3'"thiacytidine (3TC) and 2',3'-didehydro-2',3'-did60xy1hymidiae 

Cd4T). 
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The mefliod of daim 59, wheardn the 3 'non-extendable tenninus of second 
oligonucleotide P* is a nucleotide or nucleotide analog wHcli caonot be extended by a 
nucleic acid polymerase but which can be removed by pyropliosphorolysis,. 

The method of claim 62, wherein the nucleotide or nucleotide analog is selected from the 
group consisting of a 3'deoxynucleotide, a 2',3'-dideoxynucleotide, an acyclonucleotide, 
3'-deoxyadenosine (cordycepin), 3'-azido-3'-deoxythyniidine (AZT), 2',3'~dideoxytaoane 
(ddl), 2',3'-dideoxy-3'-thiacytidine (3TC) and 2',3'-didehydTO-2',3'-dideoxytliymidine 
(d4T). 

The method of claim 59, wherein the 3 'non-extendable terminus of first oligonucleotide 
P* is a nucleotide or nucleotide analog which cannot be extended by a nucleic acid 
polymerase but which can be removed by pyrophosphorolysis.. 

The method of claim 64, wherein the nucleotide or nucleotide analog is selected from the 
group consisting of aS'deoxytiucleotide, a2',3'H3ideoxynucleotide, an acyclonucleotide, 
3'-deoxyadenosine (cordycqjin), 3'-azido-3Vdeoxythymidine (AZT), 2',3'-dideoxyinosine 
(ddT), 2',3'-dideoxyv3'-thiacytidine (3TC) and 2^3'-didehydrp-2^3'-^deoxyfliymidine 
(d4T). 

The method of claim 40, wherein the micleic acid is a nucleic acid fliat has been cleaved. 

The method of claim 40, wbsscem the nucleic acid is a double stranded nucleic add that 
has a lesion or a nick. 

A pjTOphosphorolysis activated polymerization method of synthesizing a desired nucleic 
acid strand on a nuddc acid template strand which comprises, sedally 

(a) annealing to the template strand an activatable oligoiiucleotide P* that has a 
non-extendable 3'-deoxynudeotide at its 3' terminus and has a mismatch with respect to 

the corresponding nucleotide on the template strand, 

(b) pyrophosphorolyzing the resulting duplex with pyrophosphate and an enzyme 
that has pyrophosphonolysis activity and activates the oligonucleotide P* by removal of 
the terminal 3'-deoxynucle»tide, and 
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(c) polym6ri2iiig by extsndjng the activated oligonucleotid.e P* on the template 
strand in preseaice of four nucleoside triphosphate and a nucleic acid polymerase to 
synthesize the desired nucleic acid strand. 

69. The pyrophosphorolysis activated polymerization method of claLm 68 that includes 
anqjlification of the desired nucleic acid strand by 

(d) separating the desired nucleic acid strand of step (c) from the template strand 

and 

(e) repeating steps (a)-(d) until a desired level of amplification of the desired 
nucleic acid strand is achieved. 

70. The pyrophosphorolysis activated polymerization method of claim 69 carried out in the 
presence of a second oligonucleotide that in step (a) anneals to the separated desired 
nucleic acid strand product of st^ (d), and wherein step (c) includes polymerizing by 
extending tlie second oligonucleotide on the desired nucleic acid strand to synUiesize a 
copy of the nucleic acid template slrand, and step (d) includes separating the synthesized 
nucleic acid tenq)late strand from Ihe desired nucleic acid shmd, so that an^jlification is 
exponential. 

71. The pyrophosphorolysis activated polymerization method of claim 70 wherein a 
mismatch between ttie activatable oligonucleotide P* and the template stranfd occurs 
with respect to the corresponding nucleotide on ttie template. 

72. The pyrophosphorolysis activated polymerization method claim of 69 wherein steps (a) 
to (c) are conducted sequentially as two or more temperature stages on a ttiennocycler. 
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big: 26A 
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SEQUENCE LISTING 

<H0> City of Hope 
Liu, Qiang 
Sommer, Steve S. 
Bliggs, Arthur D. 

<120> Pyrophosphorolysis Activated Polymerization (PAP) 



<130> 1954-378-"II 

<150> OS 10/269,879 

<151> 2002-10-15 

<150> OS 09/789,556 

<151> 2001-02-22 

<150> OS 60/37 9,092 

<151> 2002-05-10 

<150> OS 60/237,180 

<151> 2000-10-03 

<1S0> US 60/187,035 

<151> 2000-03-06 

<1S0> OS 60/184,315 

<151> 2000-02-23 

<160> 72 

<170> Patentin version 3.0 

<210> 1 

<2H> 24 

<212> DNA 

<213> Artificial 

<220> 

<223> Amplification primer 

<400> 1 

gacctgcagc aagggagtca gaag 



<210> 2 

<211> 24 

<212> DNA 

<213> Artificial 

<220> 

<223> Amplification primer 

<400> 2 

tcataccgga aagggctgga gata 



<210> 3 
<211> 33 
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2 



<212> DNA 

<213> Homo sapiens 

<400> 3 

aatctgactg acccctattc cctgcttrgg aac 



<210> 4 

<211> 21 

<212> DNA 

<213> Artificial 

<220> 

<223> Oligonucleotide 

<400> 4 

actgacccct attccctgct t 



<210> 5 

<211> 22 

<212> DNA 

<213> Artificial 

<220> 

<223> Oligonucleotide 
<220> 

<221> misc_feature 

<222> (22).. (22) 

<223> dideoxynucleotide 

<400> 5 

actgaccqct attccctgct tg 



<210> 6 

<211> 23 

<212> DNA 

<213> Artificial 

<220> 

<223> Oligonucleotide 

<220> 

<221> misc_f eature 

<222> (23).. (23) 

<223> dideoxynucleotide 

<400> 6 

actgacccct attccctgct tgg 



<210> 7 

<211> 24 

<212> DNA 

<213> Artificial 

<220> 

<223> Oligonucleotide 
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3 



<220> 

<221> misc__feature 

<222> (24).. (24) 

<223> dideoxynucleotide 

<400> 7 

actgacccct attccctgct tggg 



<210> 8 

<211> 25 

<212> Dm 

<213> Artificial 

<220> 

<223> Oligonucleotide 

<220> 

<221> ndsc_f6ature 

<222> (25).. (25) 

<223> dideoxynucleotide 

<400> 8 

actgacccct attccctgct tgggg 



<210> 9 

<211> 26 

<212> DNA 

<213> Artificial 

<220> 

<223> Oligonucleotide 
<220> 

<221> niiBC_feature 

<222> (26).. (26) 

<223> dideoxynucleotide 

<400> 9 

tctgactgac ccctattccc tgcttg 



<210> 10 

<211> 24 

<212> DNA 

<213> Artificial 

<220> 

<223> Oligonucleotide 

<220> 

<221> misc_feature 

<222> (24).. (24) 

<223> dideoxynucleotide 

<400> 10 

tgactgaccc ctattccctg ctta 
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4 



<210> 11 

<211> 24 

<212> DMA 

<213> Artificial 

<220> 

<223> Oligonucleotide 

<400> 11 

tcataccgga aagggctgga gata 



<210> 12 

<211> 26 

<212> DNA 

<213> Artificial 

<220> 

<223> Oligonucleotide 

<400> 12 

taggaacttg gggggtgtca gagccc 



<210> 13 

<211> 26 

<212> DNA 

<213> Artificial 

<220> 

<223> Oligonucleotide 

<400> 13 

tctgactgac ccctattccc tgcttg 



<210> 14 

<211> 22 

<212> DNA. 

<213> Artificial 

<220> 

<223> Oligonucleotide 
<220> 

<221> niiac_feature 

<222> (22).. (22) 

<223> dideoxynucleotide 

<400> 14 

actgacccct attccctgct tg 



<210> 15 

<211> 20 

<212> DNA 

<213> Artificial 
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<220> 

<223> Oligonucleotide 

<220> 

<221> misc_feature 

<222> (20} . . (20) 

<223> dideoxynucleotide 

<400> 15 

tgacccctat tccctgcttg 20 



<210> 16 

<211> 18 

<212> DNA 

<213> Artificial 

<220> 

<223> Oligonucleotide 

<220> 

<221> misG_feature 

<222> (18).. (18) 

<223> dideoxynucleotide 

<400> 16 

acccctattc cctgcttg 18 



<210> 17 

<211> 26 

<212> DNA 

<213> Artificial 

<220> - 

<223> Oligonucleotide 
<220> 

<221> misc_feature 

<222> (26) . . (26) 

<223> dideoxynucleotide 

<400> 17 

ctattccctg cttgggaact tgaggg 26 



<210> IB 

<211> 22 

<212> DNA 

<213> Artificial 

<220> 

<223> Oligonucleotide 
<220> 

<221> misc_feature 

<222> (22).. (22) 

<223> dideoxynucleotide 



<400> 18 
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tccctgcttg ggaacttgag gg 



<210> 19 

<211> 20 

<212> Dm 

<213> Artificial 

<220> 

<223> Oligonucleotide 
<220> 

<221> i[iisc_f eature 

<222> (20).. (20) 

<223> dideoxynucleotide 

<400> 19 

cctgcttggg aacttgaggg 



<210> 20 • 

<211> 18 

<212> DNA 

<213> Artificial 

<22a> 

<223> Oligonucleotide 

<220> 

<221> mi sc_f eature 

<222> (18).. (18) 

<223> dideoxynucleotide 

<400> 20 

tgcttgggaa cttgaggg 



<210> 21 

<211> 26 

<212> DNA 

<213> Artificial 

<220> 

<223> Oligonucleotide 
<220> 

<221> inisc_feature 

<222> (26).. (26) 

<223> dideoxynucleotide 

<400> 21 

tgactgaccc ctattccctg cttagg 



<210> 22 

<211> 22 

<212> DNA 

<213> Artificial 
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7 . 

<220> 

<223> Oligonucleotide 
<220> 

<221> i[iisc_feature 

<222> (22).. (22) 

<223> dideoKynucleotide ' 

<400> 22 

tgacccctat tccctgctta gg 22 



<210> 23 

<211> 20 

<212> DBA 

<213> Artificial 

<220> 

<223> Oligonucleotide 
<220> 

<221> inisc_feature 

<222> (20).. (20} 

<223> dideoxynucleotide 

<400> 23 

accGctattc cctgcttagg 20 



<210> 24 

<211> 18 

<212> DNA 

<213> Artificial 

<220> 

<223> Oligonucleotide 
<22Q> 

<221> misc_feature 

<222> (18).. (18) 

<223> dideoxynucleotide 

<400> 24 

ccctattccc tgcttagg 18 



<210> 25 

<211> 26 

<212> DNA 

<213> Artificial 

<220> 

<223> Oligonucleotide 
<220> 

<221> misc_featur6 

<222> (26)..<26) 

<223> dideoxynucleotide 



<400> 25 
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8 



tgactgaccc ctattcgctg cttagg 



26 



<210> 26 

<211> 22 

<212> nm 

<213> Artificial 

<220> 

<223> Oligonucleotide 
<220> 

<221> niisc_feature 

<222> (22).. (22) 

<223> dideoxynucleotide 

<400> 26 

tgacccctat tcgctgctta gg 22 



<210> 27 

<211> 20 

<212> DNA 

<213>' Artificial 

<c220> ■ 

<223> Oligonucleotide 

<220> 

<221> iniac_feature 

<222> (20).. (20) 

<223> dideoxynucleotide 



<210> 28 

<2H> 18 

<212> DNA 

<213> Artificial 

<220> 

<223> Oligonucleotide 

<220> 

<221> misc_feature 

<222> (18).. (IB) 

<223> dideoxynucleotide 

<400> 28 

ccctattcgc tgcttagg ■ 18 



<210> 29 

<21l> 26 

<212> DNA 

<213> Artificial 



<4O0> 27 

acccctattc gctgcttagg 



20 
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<220> 

<223> Oligonucleotide 
<220> 

<221> iiiisc_feature 

<222> (26).. (26) 

<223> dideoxynucleotide 

<400> 29 

tgactgaccc ttattccctg cttagg 26 



<210> 30 

<211> 22 

<2I2> DNA 

<213> Artificial 



220> 

<223> Oligonucleotide 

<220> 

<221> ini3c_feature 

<222> (22} . . (22) 

<223> dideoxynucleotide 

<400> 30 

tgacccttat tocctgctta gg 22 



<210> 31 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Oligonucleotide 
<2203- 

<221> misc_feature 

<222> £20) . . (20) 

<223> dideoxynucleotide 

<400> 31 

acccttattc cctgcttagg 20 

<210> 32 

<211> IE 

<212> DNA 

<213> Artificial 

<220> 

<223> Oligonucleotide 
<220> 

<221> inisc_feature 

<222> (18).. {18) 

<223> dideoxynucleotide 
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<400> 32 

ecttattccc tgcttagg 



<210> 33 

<211> 40 

<212> DNA 

<213> Homo sapiens 



<400> 33 

gactgacccc tattccctgc ttrggaactt gaggggtgtc 

<210> 34 

<211> 18 

<212> DNA 

<213> Artificial 



<220> 

<223> Oligonucleotide 
<220> 

<221> niisc_feature 

<222> (18).. (IS) 

<223> dideoHynucleotide 

<400> 34 

acccctattc ccfcgctta 18 



<210> 35 

<211> 17 

<212> DNA 

<213> Artificial 



<220> 

<223> Oligonucleotide - ■ 

<220> 

<221> laisc_feature 

<222> (18).. (18) 

<223> dideoxynucleotide 

<400> 35 

ttccctgctt gggaact 17 



<210> 36 

<211> 18 

<212> DNA 

<213> Artificial 

<220> 

<223> Oligonucleotide 
<220> 

<221> misG_feature 

<222> (18).. (18) 

<223> dideoxynucleotide 

<400> 36 
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Gcctgcttgg gaacttga 



<210> 37 

<211> 18 

<212> DNA 

<213> Artificial 

<220> 

<223> Oligonucleotide 

<220> 

<221> misc_feature 

<222> (18).. (18) 

<223> dideoxynucieotide 

<400> 37 

tgcttgggaa cttgaggg 



<210> 38 

<211> 18 

<212> DNA 

<213> Artificial 

<220> 

<223> Oligonucleotide 
<220> 

<221> niisc_feature 

<222> (18).. (18) 

<223> dideoxynucieotide 

<400> SB 

acccctattc cctgattg 



<21.0> 39 

<211> 18 

<212> DNA 

<213> Artificial 

<220> 

<223> Oligonucleotide 
<220> 

<221> misc_featujre 

<222> (18)., (13) 

<223> dideoxynucieotide 

<400> 39 

acccctattc cgtgcttg 



<210> 40 

<211> 18 

<212> DNA 

<213> Artificial 
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<220> 

<223> Oligonucleotide 
<220> 

<221> misc_featura 

<222> (18).. (18) 

<223> dideoxynucleotide 

<400> 40 

acccctatcc cctgcttg 18 



<210> 41 

<211> .18 

<212> DNA 

<213> Artificial 

<220> 

<223> Oligonucleotide 
<220> 

<221> ittisc_feature 

<222> (18).. (18) 

<223> dideoxynucleotide 

<400> 41 

accccgattc cctgcttg ig 



<210> 42 

<211> 18 

<212> DNA 

<213> Artificial 

<220> 

<223> Oligonucleotide 
<220> 

<221> niisc_feature 

<222> (18).. (18) 

<223> dideoxynucleotide 

<400> 42 

actcctattc cctgcttg ig 

<210> 43 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 43 

tctgactgac ccctattccc tgctt 25 



<210> 44 

<211> 24 

<212> DMA 

<213> Artificj 
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<220> 

<223> Oligonucleotide 
<220> 

<221> misc_feature 

<222> (24)..<24) 

<223> dideoxynucleotide 

<400> 44 

tgactgacGC ctattccctg ctta 24 

<210> 45 

<211> 26 

<212> DNA 

<213> Artificial 

<220> 

<223> Oligonucleotide 
<220> 

<221> iuisc_feature 

<222> (26).. (26) 

<223> dideoxynucleotide 

<400> 45 

tctgactgac ccctattccc tgcttg 26 



<210> 46 

<211> 24 

<212> DKA 

<213> Artificial 

<22b> 

<223> Oligonucleotide 
<220> 

<221> misc__featttre 

<222> (24).. (24) 

<223> dideoxynucleotide 

<400> 46 ■ 

actgacccct attccctgct tggg 24 



<210> 47 

<211> 26 

<212> DNA 

<213> Artificial 

<220> 

<223> Oligonucleotide 
<220> 

<221> raisG_feature 

<222> (26) . . (26) 

<223> dideoxynucleotide 



<400> 47 
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14 



taggaacttg gggggtgtca gagccc 



26 



<210> 48 

<211> 24 

<212> DNA 

<213> Artificial 

<220> 

<223> Oligonucleotide 

<400> 48 

acggcagcac agaccagcgt gttc ■ • 24 



<210> 49 

<211> 24 

<212> DNA 

<213> Artificial 

<220> 

<223> Oligonucleotide 



<210> 50 

<211> 24 

<212> DNA 

<213> Artificial 

<220> 

<223> Oligonucleotide 

<400> 50 

gaagcaatct ggctgtgcaa agtg 24 



<210> 51 

<211> 26 

<212> DNA 

<213> Artificial 

<220> 

<223> Oligonucleotide 



<210> 52 

<211> 26 

<212> DHA 

<213> Artificial 

<220> 

<223> Oligonucleotide 



<400> 4 9 

ttgccactca agcggtcctc teat 



24 



<4D0> 51 

tctgactgac ccctattccc tgctta 



26 



<400> 52 
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tctgactgac ccctattccc tgcttg 



<210> 53 

<211> 26 

<212> DNa 

<213> Artificial 




<220> 

<223> Oligonucleotide 




<220> 

<221> misc feature 

<222> (26).. (26) 

<223> dideoxynucleotide 




<400> 53 

tctgactgac ccctattccc tgctta 


26 


<210> 54 

<211> 26 

<212> iMA 

<213> Artificial 




<220> 

<223> Oligonucleotide 




<220> 

<221> misc feature 

<222> (26) . . (26) 

<223> dideoxynucleotide 




<400> 54 

tctgactgac ccctattccc tgcttg 


26 


<210> 55 

<211> 20 

<212> DNA 

<213> Homo sapiens 




<400> 55 

cgagaagttt ttgaaaacac 


20 


<210> 56 

<211> 20 

<212> DM 

<213> Homo sapiens 




<400> 56 

gaacatacag agcaaaagcg 


20 



<210> 57 
<211> 30 
<212> DNA 
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<213> artificial 
<220> 

<223> Oligonucleotide 

<220> 

<221> misG_feature 

<222> (30).. (30) 

<223> acyclonucleotide 

<400> 57 

cgaagcctgt aaagcggcgg tgcacaatcg 



<210> 58 

<211> 25 

<212> DNA 

<213> Artificial 

<220> 

<223> Oligonucleotide 

<400> 58 

actgttgatg ggtgtctggt cagag 



<210> 59 

<211> 30 

<212> DNA 

<213> Artificial 

<220> 

<223> Oligonucleotide 
<220> 

<221> mlsc_feature 

<222> (30).. (30) 

<223> acyclonucleotide 

<400> 59 

tgatcagccc actgacgcgt tgcgcgagac 



<210> 60 

<211> 21 

<212> DNA 

<213> Artificial 

<220> 

<223> Oligonucleotide 

<400> 60 

acaactggcg ggcaaacagt c 

<210> 61 

<211> 40 

<212> DNA 

<213> Artificial 



16 



30 



25 



30 
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<220> 

<223> Oligonucleotide 
<220> 

<221> iaisG_featurs 

<222> (40).. (40) 

<223> dideoxynucleotide 

<400> 61 

gatggcggag ctgaattaca ttcccaaccg cgtggcacat 



<210> 62 

<211> 40 

<212> Dm 

<213> Artificial 

<220> 

<223> Oligonucleotide 
<220> 

<221> misc_feature 

<222> (40).. (40) 

<223> dideoxynucleotide 

<400> 62 

ggcaacgcca atcagcaacg actgtttgcc cgccagttga 



<210> 63 

<211> 40 

<212> UHA 

<213> Artificial 

<220> 

<223> Oligonucleotide 
<220> 

<221> misc_feature 

<222> (40).. (40) 

<223> dideoxynucleotide 

<400> 63 

gaagcggcgt cgaagcctgt aaagcggcgg -tgcacaatcg 

<210> 64 

<2H> 42 

<212> DNfl. 

<213> Artificial 

<220> 

<223> Oligonucleotide 
<220> 

<221> misc^feature 

<222> (42) . . (42) 

<223> dideoxynucleotide 



<400> 64 
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gcggatagtt aatgatcagc ccactgacgc gttgcgcgag ac , 42 



<210> 


65 


<211> 


40 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Oligonucleotide 


<220> 




<221> 


misc feature 


<222> 


(40) . . (40) 


<223> 


dideoxynucleotide 


<400> 


65 



gaagcggcgt cgaagcctgt aaagcggcgg tgcacaatcc 40 



<210> 66 

<211> 42 

<212> DNA 

<213> Artificial 

<220> 

<223> Oligonucleotide 

<220> 

<221> misc_feature 

<222> (42).. (42) 

<223> dideoxynucleotide 

<400> 66 

gcggatagtt aatgatcagc ccactgacgc gttgcgcgag ag 42 



<210> 


67 


<211> 


40 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Oligonucleotide 


<220> 




<221> 


misc feature 


<222> 


(40).. (40) 


<223> 


dideoxynucleotide 


<400> 


67 



gaagcggcct cgaagcctgt aaagcggcgg tgcacaatct 40 



<210> 68 

<211> 42 

<212> DNA 

<213> Artificial 
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<220> 

<223> Oligonucleotide 
<220> 

<221> misc_feature 

<222> {42].. {42} 

<223> dideoxynucleotide 

<400> 68 

gcggatagtt aatgatcagc ccactgacgc gfctgcgcgag aa 42 



<210> 


69 


<211> 


40 


<212> 


um. 


<213> 


Artificial 


<220> 




<223> 


Oligonucleotide 


<220> 




<221> 


misc feature 


<222> 


(40).. (40) 


<223> 


dideoxynucleotide 


<400> 


69 



gatggcggag ctgaattaca ttcccaaccg cgtggcacaa 40 



<210> 


70 


<211> 


40 


<212> 


DNA 


<213> 


Jirtificial 


<220> 




<223> 


Oligonucleotide 


<220> 




<221> 


misc feature 


<222> 


(40).. (40) 


<223> 


dideoxynucleotide 


<400> 


70 



ggcaacgcca atcagcaacg actgtttgcc cgcctattgt 40 



<210> 


71 


<211> 


40 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Oligonucleotide 


<220> 




<221> 


misc feature 


<222> 


(40).. (40) 


<223> 


dideoxynucleotide 


<400> 


71 
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tacattccca accgcgtggc acaacaactg gcgggcaaac 



<2iO> 72 

<211> 40 

<212> DNA 

<213> Artificial 

<220> • 

<223> Oligonucleotide 

<220> 

<221> misc_feature 

<222> MD)..(40) 

<223> dideoxynucleotide 

<400> 72 

gggccagact ggaggtggca acgccaatca gcaacgactg 



